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Cell-Cycle Regulatory Proteins from Human 
Pathogens, and Uses Related Thereto 



Background of the Invention 
5 The progression of a proliferating eukaryotic cell through the cell-cycle checkpoints 

is controlled by an array of regulatory proteins that guarantee that mitosis occurs at the 
appropriate time. Protein phosphorylation is the most conmion post-translational 
modification that regulates processes inside the cells, and a large number of cell cycle 
transitions are regulated by, in addition to protein-protein interactions, the phosphorylation 

10 states of various proteins. In particular, the execution of various stages of the cell-cycle is 
generally believed to be under the control of a large number of mutually antagonistic kinases 
and phosphatases. A paradigm for these controls is the CDC2 protein kinase, a cyclin- 
dependent kinase (CDK) whose activity is required for the triggering of mitosis in eukaryotic 
cells (for reviews, see Hunt (1989) Curr Opin. Cell Biol, 1:268-274; Lewin (1990) Cell 

15 61:743-752; and Nurse (1990) Nature 344:503-508). During mitosis, the CDC2 kinase 
appears to trigger a cascade of downstream mitotic phenomena such as metaphase alignment 
of chromosomes, segregation of sister chromatids in anaphase, and cleavage furrow 
formation. Many target proteins involved in mitotic entry of the proliferating cell are directly 
phosphorylated by the CDC2 kinase. For instance, the CDC2 protein kinase acts by 

20 phosphorylating a wide variety of mitotic substrates involved in regulating the cytoskeleton 
of cells, such that entry into mitosis is coordinated with dramatic rearrangement of 
cj-toskeletal elements. 

The CDC2 kinase is subject to multiple levels of control. One well-characterized 
mechanism regulating the activity of CDC2 involves the phosphorylation of tyrosine, 

25 threonine, and serine residues; the phosphorylation level of which varies during the cell-cycle 
(Krekk et al. (1991) EMBO J. 10:305-316; Draetta et al (1988) Nature 336:738-744; 
Dunphy et al (1989) Cell 58:181-191; Moria et al (1989) Cell 58:193-203; Gould et aL 
(1989) Nature 342:39-45; and Solomon et al (1990) Cell 63:1013-1024). The 
phosphorylation of CDC2 on Tyr-15 and Thr-14, two residues located in the putative ATP 

30 binding site of the kinase, negatively regulates kinase activity. This inhibitory 
phosphorylation of CDC2 is mediated at least in part by the weel and mikl tyrosine kinases 
(Russel et al (1987) Cell 49:559-567; Lundgren et al (1991) Cell 64:1111-1122; 
Featherstone et al (1991) Nature 349:808-811; and Parker et al (1992) PNAS 89:2917- 
2921). These kinases act as mitotic inhibitors, over-expression of which causes cells to arrest 

35 in the G2 phase of the cell-cycle. By contrast, loss of function of weel causes a modest 
advancement of mitosis, whereas loss of both weel atnd mikl function causes grossly 
premature mitosis, uncoupled from all checkpoints that normally restrain cell division 
(Lundgren et al (1991) Cell 64:1 11 1-1 122). 
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Summary of the Invention 
The present invention provides reagents and assays which permit rapid detection and 
evaluation of Candida yeast infections without employing cuituring, incubation, sub- 
culturing or microscopic examination. 
5 The present invention also makes available reagents and 2issays for identifying 

compounds which have antifungal properties and which may be used as anti-mycoiic agents. 
Such agents developed with the subject assays can be used therapeutically, as well as, for 
example, preservatives in foodstuff, feed supplement for promoting weight gain in livestock, 
or in disinfectant formulations for treatment of non-living matter, e.g., for decontaminating 

10 hospital equipment and rooms. 

In particular, the present invention relates to the discovery of novel cell-cycle 
regulatory proteins from animal pathogens, particularly from members of the genus Candida, 
One aspect of the invention features a Candida TYPl poljqDeptide, preferably a substantially 
pure preparation of a TYPl polypeptide, or a recombinant TYPl polypeptide. The TYPl 

15 protein shares certain features which suggest that it is a homolog to the S. Pombe cdc25 
phosphatase. In preferred embodiments, the biological activity of the polypeptide includes 
the ability to specifically bind a cyclin dependent kinase (CDK). Preferably, the TYPl 
polypeptide has a phosphatase activity, e.g. a phosphotyrosine phosphatase activity, e.g. a 
phosphoserine/phosphoihreonine phosphatase activity. The TYPl polypeptide may also 

20 generally be characterized as having an amino acid sequence at least 60%, 80%, 90% or 95% 
homologous to the amino acid sequence in SEQ ID No: 7, in addition to those forms of the 
polypeptide which comprise an amino acid sequence identical to the polypeptide designated 
by SEQ ID No: 7. Preferred TYPl polypeptides are at least 5, 10, 20, 50, 100, or 150 amino 
acids in length; e.g., the polypeptide comprises at least 5, preferably at least 10, more 

25 preferably at least 20, more preferably at least 50, 1 00, or 1 50 contiguous amino acids from 
SEQ ID No: 7. Moreover, the subject TYPl polypeptides can either mimic (agonize) or 
inhibit (antagonize) the biological activity of the wild-type form of the phosphatase, e.g., of 
its ability to. regulate Candida cell proliferation. In . preferred embodiments, the TYPl 
polypeptide is isolated or is a recombhinant form of a gene expressed by one of Candida 

30 albicans, Candida stellatoidea, Candida tropicalis, Candida parapsilosis, Candida krusei, 
Candida pseudo tropical is, Candida quillermondii, Candida glabrata, Candida lusianiae, or 
Candida rugosa. 

Another aspect of the invention features a Candida CKSl polypeptide, preferably a 
substantially pure preparation of a CKSl polypeptide, or a recombinant CKSl polypeptide. 
35 In preferred embodiments, the biological activity of the polypeptide includes the abilit>' to 
specifically bind a cyclin dependent kinase (CDK). Preferably, the CKSl polypeptide 
modulates the kinase activity of a CDK. The CKSl polypeptide may generally be 
characterized as having an amino acid sequence at least 60%, 80%, 90% or 95% homologous 
to the amino acid sequence in SEQ ID No: 8, in addition to those forms of the polypeptide 
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which comprise an amino acid sequence identical to the polypeptide designated by SEQ ID 
No: 8. Prefeired CKSl polypeptides are at least 5, 10, 20, 50 or 75 amino acids in length; 
e.g., the polypeptide comprises at least 5, preferably at least 10, more preferably at least 20,' 
more preferably at least 50 or 75 contiguous amino acids from SEQ ID No: 8. Moreover, the' 
5 subject CKSl polypeptides can either mimic (agonize) or inhibit (antagonize) the biological 
activity of the wild-type fonn of the protein, e.g., of its ability to regulate Candida cell 
proliferation. In preferred embodiments, the CKSl polypeptide is isolated from one of 
Candida albicans, Candida stellatoidea, Candida tropicalis, Candida parapsilosis, Candida 
krusei, Candida pseudotropicalis, Candida quiUermondii, Candida glabrata. Candida 
10 lusianiae, or Candida rugosa. 

Another aspect of the invention features a Candida CDKl polypeptide, preferably a 
substantially pure preparation of a CDKl polypeptide, or a recombinant CDKl polypeptide. 
In preferred embodiments, the biological activity of the polypeptide includes the ability to 
specifically bind a cyclin. Preferably, the CDKl polypeptide has an intrinsic kinase activity, 
15 which may depend on formation of a complex with a cyclin. The CDKl polypeptide may 
generally be characterized as having an amino acid sequence at least 60%, 80%, 90% or 95% 
homologous to the amino acid sequence in SEQ ID No: 9, in addition to those' forms of the 
pobTJeptide which comprise an amino acid sequence identical to the polypeptide designated 
by SEQ ID No: 9. Preferred CDKl polypeptides are at least 5, 10, 20, 50. 100 or 150 amino 
20 acids in length; e.g., the polypeptide comprises at least 5, preferably at least 10, more 
preferably at least 20, more preferably at least 50, 100 or 150 contiguous amino acids from 
SEQ ID No: 9. Moreover, the subject CDKl polypeptides can eitiier mimic (agonize) or 
inhibit (antagonize) the biological activity of the wild-type form of the kinase, e.g., of its 
ability to regulate Candida cell proliferation. In preferred embodiments, the CDKl 
25 polypeptide is -isolated from one of Candida albicans, Candida stellatoidea, Candida 
iropicalis, Candida parapsilosis, Candida krusei, Candida pseudotropicalis, Candida 
quiUermondii. Candida glabrata. Candida lusianiae, or Candida rugosa. 

Another aspect of the invention features a Candida CYBl polypeptide, preferably a 
substantially pure preparation of a CYBl polypeptide, or a recombinant CYBl polypeptide. 
30 In preferred embodiments, the biological activity of the polypeptide includes tiie ability to 
specifically bind a cyclin dependem kinase (CDK). Preferably, the CYBl polypeptide 
modulates the kinase activity of a CDK. The CYBl polypeptide may generally be 
characterized as having an amino acid sequence at least 60%, 80%, 90% or 95% homologous 
to the amino acid sequence in SEQ ID No: 10, in addition to those forms of tiie polypeptide 
35 which comprise an amino acid sequence identical to the polypeptide designated by SEQ ID 
No: 10. Preferred CYBl polypeptides are at least 5, 10, 20, 50, 100 or 150 amino acids in 
length; e.g., tiie polypeptide comprises at least 5, preferably at least 10, more preferably at 
least 20, more preferably at least 50, 100 or 150 contiguous amino acids from SEQ ID No: 
10. Moreover, the subject CYBl polypeptides can eitiier mimic (agonize) or inhibit 
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(antagonize) the biological activity of the wild-type form of the protein, e.g., of its ability to 
regulate Candida cell proliferation. In preferred embodiments, the CYBl polypeptide is 
isolated from one of Candida albicans^ Candida stellatoidea^ Candida tropicalis^ Candida 
parapsilosis, Candida krusei^ Candida pseudotropicalis, Candida quillermondii, Candida 
5 glabra! a, Candida lusianiae, or Candida rugosa. 

Another aspect of the invention features a Candida MOCl polypeptide, preferably a 
substantially pure preparation of a MOCl polypeptide, or a recombinant MOCl polypeptide. 
In preferred embodiments, the biological activity of the polypeptide includes the ability to 
specifically bind a cyclin dependent kinase (CDK). Preferably, the MOCl polj^eptide has a 

10 kinase activity, e.g. a serine/theronine kinase activity. The MOCl polypeptide may also 
generally be characterized as having an amino acid sequence at least 60%, 80%, 90% or 95% 
homologous to the amino acid sequence in SEQ ID No: 1 1, in addition to those forms of the 
polypeptide which comprise an amino acid sequence identical to the polypeptide designated 
by SEQ ID No: 11. Preferred MOCl polypeptides are at least 5, 10, 20, 50, 100, or 150 

15 amino acids in length; e.g., the polypeptide comprises at least 5, preferably at least 10, more 
preferably at least 20, more preferably at least 50, 100, or 150 contiguous amino acids from 
SEQ ID No: 1 1. Moreover, the subject MOCl polypeptides can either mimic (agonize) or 
inhibit (antagonize) the biological activity of the wild-type form of the kinase, e.g., of its 
ability to regulate Candida cell proliferation. In preferred embodiments, the MOCl 

20 polypeptide is isolated from one of Candida albicanSy Candida stellatoidea, Candida 
tropicaliSj Candida parapsilosis^ Candida kruseiy Candida pseudotropicalis, Candida 
quillermondii, Candida glabrata, Candida lusianiae^ or Candida rugosa. 

In yet other preferred embodiments, the subject regulatory proteins can be provided as 
recombinant fusion proteins which includes a second polypeptide portion, e.g., a second 

25 polypeptide having an amino acid sequence unrelated to a protein represented by one of SEQ 
ID Nos: 7-12, e.g. the second polypeptide portion is glutathione-S-transferase, e.g. the second 
polypeptide portion is a DNA binding domain, e.g. the second polypeptide portion is a 
polymerase activating domain, e.g. the fusion protein is functional in a two-hybrid assay. 

Yet another aspect of the present invention concerns an immunogen comprising at 

30 least a portion of a polypeptide designated by one of SEQ ID Nos. 7-12 in an immunogenic 
preparation, the immunogen being capable of eliciting an immune response specific for the 
subject protein; e.g., a humoral response, e.g., an antibody response; e.g., a cellular response. 

A still further aspect of the present invention features an antibody preparation 
specifically reactive with an epitope of one of the subject regulatory proteins. 

35 Another aspect of the present invention provides a substantially pure nucleic acid 

having a nucleotide sequence which encodes one of the subject polypeptides. Furthermore, 
in certain preferred embodiments, the subject nucleic acids will include a transcriptional 
regulatory sequence, e.g. at least one of a transcriptional promoter or transcriptional enhancer 
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sequence, operably linked to the recombinant gene sequence, e.g., to render the recombinant 
gene sequence suitable for use as an expression vector. 

In yet a further preferred embodiment, the nucleic acid hybridizes under stringent 
conditions to a nucleic acid probe corresponding to at least 20 consecutive nucleotides of a 
5 gene designated by one of SEQ ID Nos: 1-6; more preferably it hybridizes to at least 40 
consecutive nucleotides of one of SEQ ID Nos: 1-6; and even more preferably it hybridizes 
to at least 60, 90 or 120 consecutive nucleotides of one of SEQ ID Nos: 1-6. 

In addition, the present invention makes available assays and reagents for identifying 
anti-proliferative agents, such as mitotic and meiotic inhibitors, w^hich act by inhibiting 

10 biological action of one of the subject regulatory proteins. The subject assays include those 
designed to identify agents which disrupt binding to other regulatory proteins, as well as (if 
applicable) agents which function as inhibitors of the catalytic activity of the subject protein. 
Other features and advantages of the invention will be apparent from the following 
detailed description, and from the claims. The practice of the present invention will 

I employ, unless otherwise indicated, conventional techniques of cell biology, cell culnire, 
molecular biology, microbiology, recombinant DNA, and immunology, which are within the 
skill of the an. Such techniques are explained fully in the literature. See, for example. 
Molecular Cloning A Laboratory Manual, 2nd Ed., ed. by Sambrook, Fritsch and Maniatis 
(Cold Spring Harbor Laboratory Press: 1989); DNA Cloning, Volumes I and II (D. N. Glover 

20 ed., 1985); Oligonucleotide Synthesis (M. J. Gait ed., 1984); MuUis et al. U.S. Patent No. 
4.683.195; Nucleic Acid Hybridization (B. D. Hames & S. J. Higgins eds. 1984); 
Transcription And Translation (B. D. Hames & S. J. Higgins eds. 1984); Culture Of Animal 
Cells (R. 1. Freshney, Alan R. Liss, Inc., 1987); Immobilized Cells And Enzymes (IRL Press, 
1986); B. Perbal, A Practical Guide To Molecular Cloning (1984); Methods In Enzymology 

25 (Academic Press, Inc., N.Y.); Gene Transfer Vectors For Mammalian Cells (J. H. Miller and 
M. P. Calos eds., 1987, Cold Spring Harbor Laboratory); Methods In Enzymology, Vols. 154 
and 155 (Wu et al eds.). Immunochemical Methods In Cell And Molecular Biology (Mayer 
and Walker, eds.. Academic Press, London, 1987); Handbook Of Experimental Immunology, 
Volumes I-IV (D. M. Weir and C. C. Blackwell, eds., 1986). 

30 

Description of the Drawings 

Figure 1 A demonstrates a Lineweaver-Burke analysis for recombinant Candida TYPl 
hydrolysis of para-nitrophenylphospate. 

Figure IB demonstrates a Lineweaver-Burke analysis for recombinant Candida TYPl 
35 hydrolysis of fluoroscein diphosphate. 

Detailed Description of tite Invention 
Protein phosphorylation is the most common post-translational modification that 
regulates processes inside cells and plays a key role in regulating the cell cycle engine. 
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Protein kinases add phosphates to proteins by transferring phosphate groups from, for 
example, ATP, to hydroxyl groups on amino acid side chains; protein phosphatases remove 
the phosphate group. Phosphorylation of a given amino acid in a protein can have a variety 
of effects: activating or inactivating a protein's enzymatic activity, or altering a protein's 
5 affinity for binding to other proteins. In dividing eukaryotic cells, circuits of regulatory 
kinases and phosphatases oversee both the initiation and completion of the major transitions 
of both the meiotic and mitotic cell-cycles. These regulatory networks guarantee that the 
successive events of each cell-cycle occur in a faithful and pimctual manner. Passage of a 
cells through the cell cycle is regulated at a number of key control points. For example, 

10 mitosis cannot begin imtil the cell has grown sufficiently and replicated its genome 
accurately. Likewise, cell division cannot ensue until the mitotic spindle has distributed the 
chromosomes equally to both daughter cells. 

In fission and budding yeasts, CDC2 (CDC28 in budding yeast) is the catalytic 
subunit of a protein kinase complex which is required for both DNA synthesis and for entry 

15 into mitosis. The timing and activation of the CDC2 kinase is regulated by a physical 
association with regulatory subunits called cyclins, as well as a network of protein kinases 
and phosphatases. For example, inhibitory phosphorylation of Tyr-15 and/or Thr-14 is 
mediated by the antagonistic actions of the Weel protein kinase and the CDC25 tjrosine 
phosphatase, the dephosphorylating activity of the latter resulting in activation a kinase 

20 activity of a CDC2/cyclin complex. Moreover, cyclins and cyclin dependent kinases (CDK), 
such as CDC2, are key components of the eukaryotic cell cycle in both unicellular and 
multicellular organisms, with similar allosteric control of CDKs existing amongst 
multicellular organisms. 

The present invention derives from the discovery and isolation of genes encoding 

25 novel cell-cycle regulatory proteins from the human fungal pathogen Candida, One benefit 
provided by the present invention derives from the use of the subject proteins, antibodies and 
nucleic acids as reagents for diagnositic assays. Conventional diagnosis, as indicated above, 
often involves time-consuming steps for determining the presence of infection. Such delays 
can be unacceptable where prompt treatment must be accorded in order to provide positive 

30 prognosis. The subject diagnostic assays, particularly PCR-based procedures, can provide 
diagnostically relevant information in shorter time periods. 

Furthermore, in light of the expected indespensible role of each of these proteins in 
control of cell proliferation, the present invention specifically contemplates drug screening 
assays which detect agents that disrupt the activity of one or more of the subject regulatory 

35 proteins, such as by disruption of binding to other cellular proteins or, where applicable, by 
inhibition of an enzymatic activity of the protein. Agents which inhibit the activation of 
Candida CDKs can be used as anti-fungal agents, such as to treat mycotic infections in 
animals, as preser\'atives in foodstuff, as a feed supplement for promoting weight gain in 
livestock, or in disinfectant formulations for decontaminanting equipment and rooms. 
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In panicular, we have isolated from Candida genes which encode an apparent CDC25 
phosphatase ("TYPI a pl3-l homolog ("CKSl "X a cyclin dependent kinase ("CDKl") a 
? kinase catalytic subtu^it ("MOCl"), and a Map kinase 

ktnase ("CMKr). Each of these genes, while sharing some degree of homology wi^ genes 
of other eukao^otes. are typically less than about 75 percent homologous with known genes 
and many are less than 50 percent homologous with known genes. For convenience Table I 
provides a guide to the relevant Sequence Listing entries which set forth the nucleic acid and 
ammo acid sequences for the each of the subject regulatory genes. 
Table J: Sequence Listing Guide 



30 



clone nucleic acid 



amino acid 



sequence sequenc e 

TYPI SEQIDNo.l SEQIDNo.7 

CKS SEQIDNo.2 SEQIDNo.8 

^fS^^'^^-^ SEQIDNo.9 
C YB SEQ ID No. 4 SEQ ID No. 10 

CMv l SEQ ID No. 11 

CMKl SEQ ID No. 6 SEQ ID No. 12 

annend.?', '^^'^ ^^^'^y^^ - the specification, examples, and 

appended claims are collected here. 

As used herein, the term "nucleic acid" refers to polynucleotides such as 

rr n ^PP^^P"^^^' (RNA). The term 

should also be understood to include, as equivalents, analogs of either RNA or DNA made 

IrnT. " K " " ^° embodimem being described, single- 

stranded (such as sense or antisense) and double-stranded polynucleotides 

As used herein, the term "gene" or "recombinant gene" refers to a nucleic acid 
compnsmg an open reading frame encoding a polypeptide of the presem invention, including 
both exon and (optionally) intron sequences. A "recombinant gene" refers to nucleic acid 
encoding such regulatory polypeptides, which may optionally include intron sequences which 
are erther denved from a chromosomal DNA. Exemplary recombinant genes encoding the 
subject regulatory proteins are represented ir. SEQ ID Nos: 1 -6. The term "intron" refers to a 
DNA sequence presem in a given gene which is not translated into protein and is generally 
found between exons. ^ 

AS used herein, the term "transfection" means the introduction of a nucleic acid 
e^g-, ^ expression vector, into a recipiem cell by nucleic acid-mediated gene transfer 
Transforrnation", as used herein, refers to a process in which a cell's genotype is changed as 
a result of the cellular uptake of exogenous DNA or RNA, and, for exampirthe transfLed 
cell expresses a recombinant form of a polypeptide of the presem invention or wher^ anti- 
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sense expression occurs from the transferred gene, the expression of a naturally-occurring 
form of the protein is disrupted. ^ 

As used herein, the term "vector" refers to a nucleic acid molecule capable of 
transponing another nucleic acid to which it has been linked. One type of preferred vector is 
5 an episome, i.e., a nucleic acid capable of extra-chromosomal replication. Preferred vectors 
are those capable of autonomous replication and^expression of nucleic acids to which they are 
linked. Vectors capable of directing the expression of genes to which they are operatively 
linked are referred to herein as "expression vectors". In general, expression vectors of utility 
in recombinant DNA techniques are often in the form of "plasmids" which refer to circular 

10 double stranded DNA loops which, in their vector form are not bound to the chromosome. In 
the present specification, "plasmid" and "vector" are used interchangeably as the plasmid is 
the most commonly used form of vector. However, the invention is intended to include such 
other forms of expression vectors which serve equivalent functions and which become known 
in the art subsequently hereto. 

15 "Transcriptional regulatory sequence" is a generic term used throughout the 

specification to refer to DNA sequences, such as initiation signals, enhancers, and promoters, 
which induce or control transcription of protein coding sequences with which they are 
operably linked. In preferred embodiments, transcription of a recombinant gene is imder the 
control of a promoter sequence (or other transcriptional regulatory sequence) which controls 

20 the expression of the recombinant gene in a cell-type in which expression is intended. It will 
also be understood that the recombinant gene can be imder the control of transcriptional 
regulatory sequences which are the same or which are different from those sequences which 
control transcription of the naturally-occurring form of the protein. 

"Homology" refers to sequence similarity between two peptides or between two 

25 nucleic acid molecules. Homology can be determined by comparing a position in each 
sequence which may be aligned for purposes of comparison. When a position in the 
compared sequence is occupied by the same base or amino acid, then the molecules are 
homologous at that position. A degree of homology between sequences is a function of the 
number of matching or homologous positions shared by the sequences. 

30 "Cells," "host cells" or "recombinant host cells" are terms used interchangeably 

herein. It is imderstood that such terms refer not only to the particular subject cell but to the 
progeny or potential progeny of such a cell. Because certain modifications may occur in 
succeeding generations due to either mutation or environmental influences, such progeny 
may not. in fact, be identical to the parent cell, but are still included within the scope of the 

35 term as used herein. 

A "chimeric protein" or "fusion protein" is a fusion of a first amino acid sequence 
encoding one of the subject polypeptides with a second amino acid sequence defining a 
domain foreign to and not substantially homologous with any domain of the first polypeptide. 
A chimeric protein may present a foreign domain which is found (albeit in a different 



BNSDOCID: <WO 9639527A1 J_> 



wo 96/39527 

PCT/US96/08807 

- 10- 

protein, in „ organise, wluch also expresses *e firs, proKin, or i, may be an "inKr.pccies- 
e,c. teion Of protein sm.c,ures expressed by dtfferen, JL of orjaniZ^ ' 

each Of*' '™, ™* -^P^" <o nucleic acid sequences encoding 

e«h of *. regulatory pt^teins. refers to nucleic acid sequences which have arisen natural^ 
5 .n an organtsm. .ncludirig naturally occurring ntutants. TT,e tern, also refers to nJT^M 
sequences wtad, whUe derived ftotn naturally occuning genes, have been altld by 
mi^-agen^is. - for example, combinatorial mutagenesis described below, yet sU ™lc^ 
polypepudes which have a. least one activity of U,e wild-type (-authentic-) protein 

,r*e ° ""'"'^-"'^'■'•^^•-•^As.respecrtvely.tbatarepres^^^ 

n the nan^l source of the macromolecule. For example, isolated mtcleic ac ds enJnTT 
subject polypepudes preferably include no more than 10 kilobases (lb) of nucleic acM 

mo« d»„ 5kb of such naturally occurring flanking sequences, and most preferilv le "to 

1 o a-lr^'^, """1"* "^"^ ^= - '-'-^ as used he^" 

refers ,0 a nucleic acd or polypepUde that is substantially ft« of cellular material viral 
mat^tal. or culmre medium when pr^uced by recombinant DNA techniques H n.^^ 

acid ,s mea^t to tndude nucleic acid ftagments which ^ no, nanually occurring as 
20 ftagmeras and would no, be found in flie nanral state. occumng as 

As described below, one aspect of this invention pertains to an isolated nucleic acid 
c^pns„,g ,he nucleotide sequence encoding one of the subject ,.gul«or^teI 
b^tegtcally acuve ftagments thereof, and/or equivalents of such nucleic acids 

^uTcs Tntlr h ^ » '"^'"''^ nucleotide' 

a^^l^T , I"""^ equivalent to the polypeptides 

wWct f r '""^ " '° ^"•"'"■-"y '""ivalent ^Ivpeptides 

U^™? ' "^"^ addirionally ret'am other 

act,v,t,es of a TYPl protetn. e.g., dM ability ,o bind to a CDK. e.g a a>Kl 

™i "'™'°" eontemplatts nucleic acids which «Kode 

^lypepudes tha, are homologous and functionally equivalent to other oft ^ 

::"lrc~ '>P=P-ofCKS, may^amt^eT^'i;' 

n . , ' ^' and/or other regulatory nroteins well 

as cellular substrates of the authentic forni of the kinase In .^H v 

. . " "'^ Kinase. In addition, an equivalent CDK"! 

polypepude may retain its kinase activity In ^!mn«r f u- 'I^'^^'ent CDKl 

, . . , av-uviiy. in Similar fashion, an equivalent MOri 

polypeptide may be characterized bv binding to CDK 1 or »„«.k .- . '^"'''^'^"^ ' 
well a.; nr «lt.r«.f 1 u - ..' ^ ^^^^ ^^t^^^r cyclin-dependent kinase, as 
well as, or alternatively, by its kmase activity to>vards substrates of the naturally occurring 
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form of the protein. Equivalent polypeptides of the subject CYBl protein will typically 
retain the ability to bind to a CDK, e.g. CDKl . 

Moreover, it will be understood that such equivalent polypeptides as described above 
may mimic (agonize) the actions of the authentic form of one of the subject regulatory 
5 proteins. However, it is expressly provided that such equivalents include polypeptides which 
function to antagonize the normal function of the wild-type protein. For instance, dominant 
negative mutants of any of the enzymes TYPL CDKl , MOCl or CMKl may competitively 
inhibit the fimction of the authentic protein by binding to substrate without catalytically 
acting upon it. Mutants of any of the subject proteins which produce non-productive 

10 complexes with other regulatory proteins can likewise be antagonistic homologs. 
Accordingly, the term "biological activity", with respect to homologs of the proteins 
enumerated in the Sequence Listing, refers to both agonism and antagonism of the ordinary 
function of the wild-type form of that protein. 

Thus, equivalent nucleotide sequences will include sequences that differ by one or 

15 more nucleotide substitutions, additions or deletions, such as intragenus variants; and will 
also include sequences that differ from the nucleotide sequence encoding the portion of the a 
protein represented in one of SEQ ID Nos. 1-6 due to the degeneracy of the genetic code. 
Equivalent nucleic acids will also include nucleotide sequences that hybridize under stringent 
conditions (i.e., equivalent to about 20-27**C below the melting temperature (J^) of the DNA 

20 duplex formed in about IM salt) to a nucleotide sequence of a Candida gene represented in 
one of SEQ ID Nos. Nos. 1 -6. 

Preferred nucleic acids encode polypeptides comprising an amino acid sequence 
which is at least 60% homologous, more preferably 70% homologous and most preferably 
80% homologous with an amino acid sequence shown in one.of SEQ ID Nos. 7-12. Nucleic 

25 acids encoding polypeptides, particularly polypeptides retaining an activity of one of the 
subject regulatory proteins, and comprising an amino acid sequence which is at least about 
90%, more preferably at least about 95%, and most preferably at least about 98-99% 
homologous with a sequence shown in one of SEQ ID Nos. 7-12 are also within the scope of 
the invention. 

30 In yet a further embodiment, the recombinant regulatory genes may further include, in 

addition to the nucleic acid sequences shown in SEQ ID Nos. 1-6, additional nucleotide 
sequences. For instance, the recombinant gene can include nucleotide sequences of a PCR 
fragment generated by amplifying the gene from a genomic DNA library, e.g., intronic 
sequences, as well as 5' and 3* non-coding sequences of any of the subject genes. 

35 Another aspect of the invention provides nucleic acid that hybridizes under high or 

low stringency conditions to nucleic acid which encodes a polypeptide identical or 
homologous with an amino acid sequence represented in one of SEQ ID Nos. 7-12. 
Appropriate stringency conditions which promote DNA hybridization, for example, 6.0 x 
sodium chloride/sodium citrate (SSC) at about 45'*C, followed by a wash of 2.0 x SSC at 
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50°C. are known to those skilled in the art or can be found in Current Protocols m Molecular 
Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. For example, the salt concentration in 
the wash step can be selected from a low stringency of about 2.0 x SSC at 50=C to a high 
stnngency of about 0.2 x SSC at 50°C. In addition, the temperature in the wash step can be 
increased from low stringency conditions at room temperature, about 22°C to hiah 
stringency conditions at about 65*0. 

Isolated nucleic acids encoding a Cartdida regulatory protein of the present invention, 
yet which differ from the nucleotide sequences shown in SEQ ID Nos. 1-6 due to degeneracy 
in the genetic code, are also within the scope of the invention. Such nucleic acids are 
understood to be capable of encoding functionally equivalent polypeptides (i.e., a polypeptide 
having at least a portion of the biological activity of a protein encoded by the enumerated 
sequences). For instance, a number of amino acids are designated by more than one triplet 
Codons that specify the same amino acid (for example, CAU and CAC are svnonyms for 
histidme) may result in "silent" mutations which do not affect the amino acid sequence of the 
protem. However, it is expected that DNA sequence polymorphisms that do lead to changes 
m the ammo acid sequences of the protein will exist even within the same species One 
skiUed m the art will appreciate that these variations in one or more nucleotides (up to about 
.-40/0 of the nucleotides) of a gene encoding a protein may exist among individual cells of a 
given species, e.g., amongst a population of C. albicarv: cells, due to namral allelic variation 
Any and all such nucleotide variations and resulting amino acid polymorphisms are within 
the scope of this invention. 

Fragments of the nucleic acid encoding portions of the subject regulatory proteins, 
such as the catalytic domain of the TYPl phosphatase, are also within the scope of the 
invention. As used herein, such fragments refer to nucleotide sequences having fewer 
nucleotides than the coding sequence of Uie gene, yet still include enough of the coding 
sequence so as to encode a polypeptide with at least some of the activity of the full-length 
protein activity. 

Nucleic acids within the scope of the invention may also contain linker sequences, 
modified restriction endonuclease sites and other sequences useful for molecular cloning 
expression or purification of the recombinant polypeptides. 

As indicated by the examples set out below, a nucleic acid encoding one of the 
subject proteins may be obtained from mRNA present in the cells of a pathoeen from the 
genus Candida. It will also be possible to obtain nucleic acids encoding the subject proteins 
from genomic DNA obtained from such cells. For example, a gene encoding one of the 
pathogen regulatory proteins can be cloned from either a cDNA or a genomic library from 
other Candida species in accordance with protocols described herein, as well as those 
generally known in the art. For instance, a cDNA encoding a TYPl protein can be obtained 
by isolating total mRNA from a culmre of Candida cells, generating double stranded cDNAs 
from the total mRNA, cloning the cDNA into a suitable plasmid or bacteriophage vector, and 
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isolating clones expressing TYPl protein using any one of a number of known techniques, 
e.g., oligonucleotide probes or western blot analysis. Genes encoding related proteins can 
also be cloned using established polymerase chain reaction techniques in accordance with the 
nucleotide sequence information provided by the invention. The nucleic acid of the invention 
5 can be DNA or RNA. 

Another aspect of the invention relates to the use of the isolated nucleic acid in 
"antisense" therapy. As used herein, "antisense" therapy refers to administration or in situ 
generation of oligonucleotide probes or their derivatives which specifically hybridizes (e.g. 
binds) under cellular conditions, with the cellular mRNA and/or genomic DNA encoding one 

10 of the subject regulatory proteins so as to inhibit expression of that protein, e.g. by inhibiting 
transcription and/or translation. The binding may be by conventional base pair 
complementarity, or, for example, in the case of binding to DNA duplexes, through specific 
interactions m the major groove of the double helix. In general, "antisense" therapy refers to 
the range of techniques generally employed in the art, and includes any therapy which relies 

15 on specific binding to oligonucleotide sequences. 

An antisense construct of the present invention can be delivered, for example, as an 
expression plasmid which, when transcribed in the cell, produces RNA which is 
complementary to at least a unique portion of the cellular mRNA which encodes one of the 
regulatory proteins. Alternatively, the antisense construct is an oligonucleotide probe which 

20 is generated ex vivo and which, when introduced into the cell, causes inhibition of expression 
by hybridizing with the complementary mRNA and/or genomic sequences. In any event, it 
will be generally desirable to choose an antisense molecule which uniquely hybridizes to the 
Candida gene, e.g. does not hybridize under physiological conditions to DNA or RNA from a 
mammalian cell, especially a human cell. Such oligonucleotide probes are preferably 

25 modified oligonucleotide which are resistant to endogenous nucleases, e.g. exonucleases 
and/or endonucleases, and is therefore stable in vivo. Exemplary nucleic acid molecules for 
use as antisense oligonucleotides are phosphoramidate, phosphothioate and 
methylphosphonate analogs of DNA (see also U.S. Patents 5,176,996; 5,264,564; and 
5,256,775). Additionally, general approaches to constructing oligomers usefiil in antisense 

30 therapy have been reviewed, for example, by van der Krol et al. (1988) Biotechniques 6:958- 
976; and Stein et al. (1 988) Cancer Res 48:2659-2668. 

Accordingly, the modified oligomers of the invention are useful in therapeutic, 
diagnostic, and research contexts. In therapeutic applications, the oligomers are utilized in a 
manner appropriate for antisense therapy in general. For such therapy, the oligomers of the 

35 invention can be formulated for a variety of modes of administration, including systemic and 
topical or localized administration. Techniques and formulations generally may be found in 
Remmington's Pharmaceutical Sciences . Meade Publishing Co., Easton, PA. For systemic 
administration, injection is preferred, including intramuscular, intravenous, intraperitoneal, 
and subcutaneous for injection, the oligoniers of the invention can be formulated in liquid 
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«.lu„o„., preferably in physiologically comp^ibfe buffers such >s Hank's solution or 
Rangers solution. ,„ addition, ,„e oligonters may be fonnulated in solid fl aJJ 
—ed or suspended immediately priorto use. Lyophilized fonns are also includT 

^''"""'^ admtnisuation can also be by transmucosal or nansdennal nteans or flte 
S compounds can be administered orally. For transmucosal or transdermal adm^Stlu^ 
penc^nts apptopnate to the bamer to be pleated are used in U,e fonnulatioTTrh 
..neuants are generally Icno™ in the ar. and include, for example, for tranTmuco"* 
adrntntstration btle salts and ftsidic acid derivatives. ,„ addition, detects may~.o 
facUttate penneatton. Tt^nsmucosa, administraUon may be through lal sprays or Ig 

o iZrof 1 " r""'"' '"'^ '°>"=- -nunist^ion. the 

In addhion to use in thentpy, the oligomers of the invention may be used as diagnostic 
^.nts .o detect *e presence or absence of the targe. DNA or RNA s«n«nc« to »teh 

and apparent orfltologs of other eukao-otes, even other single cell euk^yotes the 
probe/pntner of the pr^en. invenUon »iU penni, diagnostic iavs which T lidt 
d.stu,gu.shC»:rf«„ infection from other causative agents of fungicemL ^ 
encodin?"' 'Ti,°" "'"-^ i"">«fe = ™«='eo,ide sequence 

Zet O W ^ -^"'^ » « -ne regu'ato" 

sequence. Operably hnl,ed ts intended to mean a,a, the nucleotide sequence is uJL to a 

»ty sequence in a manner which allows expression of d,e nucleotide ^ud 
Regulatory sequences are art-tecognized. Accordingly, the tern, regulated Z."" 
ntcludes pn,moters. enhance, and other expression control elements. Exemplary regZ" 

185. A^emtc Press, San Dtego, CA (1990). For instance, any of a wide ^^ety ff 
exp^sston control sequences-sequences that contto, the expression of a DNA sequenTe ^« 
openntvely hnked to ,t may be used in these vectors to express DNA sequences encoding th" 

it T 'T'^'^ °' cytomegalovirus immediate 

early promoter the lac system, the trp system, the TAC or TRC system. T7 ptomoter whose 

t-^. — 3-phospho jycerate 

oiner glycolytic enzymes, the promoters of acid phosphatase eg Pho5 the 

"'T ~ °f *e baculo^rsystl 

or Utetr vmtses, and vanous combinaUons the.«,f. „ should be understood thai the 
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design of the expression vector may depend on such factors as the choice of the host cell to 
be transformed and/or the type of protein desired to be expressed. Moreover, the vector*s 
copy number, the ability to control that copy number and the expression of any other proteins 
encoded by the vector, such as antibiotic markers, should also be considered. 
5 This invention also pertains to a host cell transfected with a recombinant gene in order 

that it may express a recombinant protem of the present invention. The host cell may be any 
prokaryotic or eukaryotic cell. For example, a TYPl protein of the present invention may be 
expressed in bacterial cells, such as £. colL insect cells, yeast, or mammalian cells. Other 
suitable host cells are known to those skilled in the art. Exemplary cells genetically 

10 engineered to produce a recombinant protein of the present invention are the 
Schizosaccharomyces cells described below. 

Another aspect of the present invention concerns recombinant forms of the subject 
Candida regulator)' proteins. The term "recombinant protein" refers to a protein of the 
present invention which is produced by recombinant DNA techniques, wherein generally 

15 DNA encoding the protein is inserted into a suitable expression vector which is in tum used 
to transform a host cell to produce the heterologous protein. Moreover, the phrase "derived 
from", with respect to a recombinant gene encoding one of the subject proteins, is meant to 
include within the meaning of "recombinant protein" those proteins having an amino acid 
sequence of the native (or "authentic") form of the pathogen protein, or an amino acid 

20 sequence similar thereto, which is generated by mutation so as to include substitutions and/or 
deletions relative to a naturally occurring form of the protein. To illustrate, recombinant 
proteins preferred by the present invention, in addition to those having an amino acid 
sequence of the native proteins, are those recombinant proteins having amino acid sequences 
which are at least 70% homologous, more preferably 80% homologous and most preferably 

25 90% homologous with an amino acid sequence shown in one of SEQ ID Nos: 7-12. A 
polypeptide which having an amino acid sequence that is at least about 95%, more preferably 
at least about 98%, and most preferably identical to one of the sequences shown in SEQ ID 
Nos: 7-12 are also within the scope of the invention. Thus, the present invention pertains to 
recombinant proteins which are derived from Candida and which have amino acid sequences 

30 evolutionarily related to a protein represented by any one of SEQ ID Nos: 7-12, wherein 
"evolutionarily related to" refers to polypeptides having amino acid sequences which have 
arisen naturally (e.g. by allelic variance or by differential splicing), as well as mutational 
variants of the regulatory proteins which are derived, for example, by combinatorial 
mutagenesis. 

35 The present invention further pertains to methods of producing the subject 

polypeptides. For example, a host cell transfected with a nucleic acid vector directing 
expression of a nucleotide sequence encoding one of the subject regulatory proteins can be 
cultured under appropriate conditions to allow expression of the polypeptide to occur. The 
polypeptide may be secreted and isolated from a mixture of cells and medium containing the 
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^o^binan, pn>,ei„^ Alternatively, the polypeptide may be retained cytoplasn,ically and the 

2 TT' T. t ^ and 

other byproducts Smtable ntedia for cell culture are well k„ov™ i„ «,e an. The polypeptide 
^ be .solated ftotn cell culture ntedium. host cells, or both using techniques J„™ i„ U>e 
^ for punfyntg proteins including ion-exchange chrontatog^phy. gel nitration 
chrontatogtaphy ultafiitraUon. elecophor«is. and inununoafflnity puriLtion 
antibodies s^tfic for the recombinant protein. ,n a prefen^ e„,hodiLnt. the reguC 

' - 

e„c„H '^'I!' " """r"' """^S ■"■»■" of*' ^-Xject proteins 

enc„d,„, all or a selected ponion of the ptotein. can be used to produce a recotibinL, fonn 
of the pn^tetn v,a mtcrobtal or eukaryotic cellular processes. Ligating the polynucleotide 

mto hosts, tther eulcatyottc (yeast, avian, insect or ntantntalian, or ptotayotic (bacterid 

Tl^'^^'^^r' " innacellular proteins. 

It ?• T P-""--. - modifications thereof 

^bee-nployedtopr^arcrcco-nbinantfcitn^ofthesubje^ 

nncrobial means or UssueH^ulntre technology in accord with «,e subject invention 

Recombinant fonns of the subject regutatoty p,«,eins can be produced by ligating the 
c^ned gene, or a portion thereof, into a vector suitable for expt^ion in ei Jp^l^ot: 

pn,.e,ns tnclude plasn„ds and other vectors. For instance, suitable vectors for the expt^on 

1^,T """"" '""■"'"^ *= '^-""^ PBR322-dcrivcd pLmids 

oL d t • "^'^"^ "S^-^-ved plasmids and pUC-derived 

plasmids for expression in piokaiyotic cells, such as £ coJi 

instance'^ *° '"■'«=<'">binant proteins in yeast For 

."Stance, pRS vectors, e.g.. pRS303. pRS304. pRS305, pRS306. etc., are cloning and 
expression vehicles usefti. in the i„u»d„ction of genetic constructs into 5. c.re.<s,ae (1 Tor 
example Silcorsici et al. (1989, Ce^.cs 122:19-27; and ChrisUanson (1992, Cr. /^„" 
12.), These vectors can replicate to £ coU due to the presence of the pBR322 od, and in 5 

TZ , " "^'^ - ^^--s. such a! 

paj/ora. IS contemplated by this invention. 

In some instances, i, may be desirable to express the recombinant genes by the use of 
a baculovn^s expression system. Examples of such baculovinis expression systems include 
pVL-denved vectors (such as pVLI392, pVLI393 and pVL941,, pAcUwieHved vectors 

m tZ Tri' t — - <he Jal containing pB 

m. P2B1UC,. Furiter, the p2Blue vector has the added feann. of being capable of expressing 



BNSDOCID: <WO_9639527A1_|_ 



wo 96/39527 



- 17- 



PCTAJS96/08807 



two exogenous proteins simultaneously (p2BIue, Invitrogen Coq5. Catalog number V-1970- 
10). 

When expression of a carboxy-terminal portion of one of the polypeptides en2yme is 
desired, i.e., a truncated form of the protein, it may be desirable to add a start codon (ATG) to 
5 the oligonucleotide fragment containing the desired sequence to be expressed. It is well 
known in the art that a methionine at the N-terminal position can be enzymatically cleaved by 
the use of the enzyme methionine aminopeptidase (MAP). MAP has been cloned from E. 
coli (Ben-Bassat et al (1987) J, Bacteriol 169:751-757) and Salmonella typhimurium and its 
in vitro activity has been demonstrated on recombinant proteins (Miller et al (1987) PNAS 

1 0 84:271 8-1722). Therefore, removal of an N-terminal methionine, if desired, can be achieved 
either in vivo by expressing recombinantly-derived polypeptides in a host which produces 
MAP (e.g., E. coli or CM89 or S, cerevisiae), or in vitro by use of purified MAP (e.g., 
procedure of Miller et al.). 

Alternatively, the coding sequences for the polypeptide can be incorporated as a part 

15 o! a fusion gene so as to be covalently linked in-frame with a second nucleotide sequence 
encoding a different polypeptide. This type of expression system can be useful, for instance, 
where it is desirable to produce an inununogenic fragment of the protein. For example, the 
VP6 capsid protein of rotavirus can be used as an immunologic carrier protein for portions of 
the T^Tl polypeptide, either in the monomeric form or in the form of a viral particle. The 

20 nucleic acid sequences corresponding to the portion of the protein to which antibodies are to 
be raised can be incorporated into a fusion gene construct which includes coding sequences 
for a late vaccinia virus structural protein to produce a set of recombinant viruses expressing 
fusion proteins comprising a portion of the TYPl protein as part of the virion. It has been 
demonstrated with the use of immunogenic fusion proteins utilizing the Hepatitis B surface 

25 antigen fusion proteins that recombinant Hepatitis B virions C£in be utilized in this role as 
well. Similarly, chimeric constructs coding for fusion proteins containing a portion of a 
TYPl protein and the poliovirus capsid protein can be created to enhance immunogenicity of 
the set of polypeptide antigens (see, for example, EP Publication No. 0259149; and Evans et 
al. (1989) Nature 339:385; Huang et oL (1988) J. Virol 62:3855; and Schlienger et al 

30 (1992)y. F/ro/. 66:2). 

The Multiple Antigen Peptide system for peptide-based immunization can also be 
utilized, wherein a desired portion of a one of the subject proteins is obtained directly from 
organo-chemical synthesis of the peptide onto an oligomeric branching lysine core (see, for 
example, Posnett et al (1988) JBC 263:1719 and Nardelli et al (1992) J, Immunol 148:914). 

35 Antigenic determinants of the subject proteins can also be expressed and presented by 
bacterial cells. 

In addition to utilizing fusion proteins to enhance immunogenicity, it is widely 
appreciated that fusion proteins can also facilitate the expression of proteins. For example, 
recombinant forms of each of the subject pathogen proteins can be generated as glutathione- 
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S-,ra„sfera.. (GST) fusion p„,ei„s. Such OST fusion p,o,ci„s be used ,o simpUfy 
pu„flca„o„ of U,e pr<«ein. such as through .he use of g,u,aU,ione-derivad.ed „a,rices fs« 

5 1!° . ? ™bodi„,en,. a fusion gene coding for a puriflcation lead^ 

" Ceavage si,e sequence a, U,e N-.ennin„s of .he 

des„ed pomon of *e ^combinan, pr„,ei„, can faci,i,a,e pudficafion of U,e fusion p^ein by 
affin,^ chrc.na.ograpl,y using a Ni2* n,e,al resin. The purification leader sequence can ften 
be subscuenriy removed by ueamen, „^J. „ 

10 Techniques for making fusion genes are »ell known. Essentially ,he ioinin^ of 

var-ous DNA .agmenfs coding for differen. polypeptide sequences l ^^21 ll 
ccordancc wtd, conventiona. tech„iq.„s. employing blunt-e^ied or staggerLed terlL 
for hgauon restnctton enzyme digestion to provide for appropriate te^ni. miinilTf 
^he ,ve ends a. appropriate, alkaline phosphatase treatment to avoid undesirable joilg 

15 and enzymauc hgatton. ,„ another embodiment, the fusion gene can be sy«hes ZZ' 
—ona. techniques incMng automated DNA synd,eLers. A terl^e " pCR 
ampMcat.cn of gene fiagmen. can be carried ou. using anchor primers which g^e'ril 

!0 M>fec„/=r Aotog,, eds. Ausabel « »/. John Wiley i Sons- 1992) 

*e su jec. fungal proteins, which are isolated from, or othet,vise substantially free 7^ J 
muacellular prcems which may be normally associa.ed. especially o.her cW 

:::L:nsr.rrg7essTro^yr:i;.>:^^^^ — ■ 

com^s less than S.eo„.amina.ng^p:~r^^^^^^ 
Zl ""^tZ r ™"'- ^ •^■Cloned gL as dLTrib^ 

0 m:™™ r^ge ZT^Z — hy dty weight. 

wei.h; r.rwJ ■ r ^ """^ preferably a. least 99.8V. by 

s3tmo elLl " "'""^ buffers, and oth^ 

(e.g., in an aciylamide gel) but not obtained either a<; nnr^ ^ v ■ ■ 

proteins, or chromatography reagents such ^dJ t 

, J reagents such as denaturing agents and oolvmeri eo 

acrylamide or agarose) substances or solutions. s g s ana polymers, e.g. 
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However, the subject polypeptides can also be provided in pharmaceutically 
acceptable carriers for formulated for a variety of modes of administration, including 
systemic and topical or localized administration. Techniques and formulations generally may 
be found in Remmington's Pharmaceutical Sciences, Meade Publishing Co., Easton, PA. In 
5 an exemplary embodiment, a dominant negative mutant of one of the subject regulatory 
proteins can be provided for transmucosal or transdermal delivery. For such administration, 
penetrants appropriate to the barrier to be permeated are used in the formulation with the 
polypeptide. Such penetrants are generally known in the art, and include, for example, for 
transmucosal administration bile salts and fusidic acid derivatives. In addition, detergents 

10 may be used to facilitate permeation. Transmucosal administration may be through nasal 
sprays or using suppositories. For topical administration, the oligomers of the invention are 
formulated into ointments, salves, gels, or creams as generally known in the art. 

Another aspect of the invention related to polypeptides derived from the full-length 
fomis of the subject proteins. Isolated peptidyl portions can be obtained by screening 

15 : polypeptides recombinantly produced from the corresponding fragment of the nucleic acid 
encoding such polypeptides. In addition, fragments can be chemically synthesized using 
techniques known in the art such as conventional Merrifield solid phase f-Moc or t-Boc 
chemistry. For example, TYPl can be arbitrarily divided into fragments of desired length 
with no overlap of the fragments, or preferably divided into overlapping fragments of a 

20 desired length. The fragments can be produced (recombinantly or by chemical synthesis) and 
tested to identify those p)eptidyl fragments which can function as either agonists or 
antagonists of, for example, CDK activation, such as by microinjection assays. In an 
illustrative embodiment, peptidyl portions of the Candida TYPl can be tested for CDK- 
binding activity, as well as inhibitory ability, by expression as, for example, thioredoxin 

25 fusion proteins, each of which contains a discrete fragment of the TYPl protein (see,* for 
example, U.S. Patents 5,270, 1 8 1 and 5,292,646; and PCT publication W094/ 02502). 

It is also possible to modify the structure of the subject regulatory proteins for such 
purposes as enhancing therapeutic or prophylactic efficacy, or stability (e.g., ex vivo shelf life 
and resistance to proteolytic degradation in vivo). Such modified polypeptides, when 

30 designed to retain at least one activity of the naturally-occurring form of the protein, are 
considered functional equivalents of the polypeptides described in more detail herein. Such 
modified polypeptides can be produced, for instance, by amino acid substitution, deletion, or 
addition. 

Moreover, it is reasonable to expect that an isolated replacement of a leucine with 2in 
35 isoleucine or valine, an aspartate with a glutamate, a threonine with a serine, or a similar 
replacement of an amino acid with a structurally related eunino acid (i.e., conservative 
mutations) will not have a major effect on the biological activity of the resulting molecule. 
Conservative replacements are those that take place within a family of amino acids that are 
related in their side chains. Genetically encoded amino acids are can be divided into four 
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families: (1) acidic - aspamic, glutamate; (2) W - lysine, a-^i^e. hisddi,«; (3) nonpolar 
- alan.„e val,ne. leucine, isoleucine, proline, phenylalanine, meWorine. ttyptoptan; and (4) 
uncharged polar = glycine, asparagine, gluiamine, cysteine, serine, ftreonine. tyrosine. 
^e„ylalan.„e „yp,„phan. and tyrosine are somcimes classified joimly as a,.n>a,ic anuno 
acids. In s,™l^ festoon. U,e amino acid repenoire can be grouped as (I) acidic = aspar,a>e. 

leucine, .soleuc ne senne, teonine, »iu, serine and d^eonine optionally be grouped 
« as altphaUc-hydroxyl; (4) a^n^tic = phenylalanine, tyrosine, 4topL: (5) 
™.d= - a^aragtne, glu.am„,e; and (6) sul&r containing - cysteine and ntethi J„e (sL 
for example aocW^ro-, 2nd ed, Ed. by L. Str^cr, WH Freeman and Co.:1981). Whether a 
Change „ the amino acid sequence of a peptide results in a funcdonal homolog of one of flte 
2- Protetns can be readily detennined by assessing the ability of the variant peptide to 

homolog .„ a^ system. Polypeptides in which more than one replacement has .Len 

15 'an readUy be tested in the same manner. 

mu,an,s'^r,.'"T''" ' -"^-^ of generaUng sets of combinatorial 

InTMn, t ? T " " «P--«y "-fl" for 

omb,„a,o„al hbranes .s, for example, to isolate homologs from the library which fiuKtion in 
leT " °" < " ~ -ti""" of 

whl 1 T r ''^ 'o Provide homologs 

whtch lack phosphatase acttvty yet still retain Ute ability ,o bind to a CDK. e.g a CDKl 

'5 ^^^"n V'" " f™''"^ -™ of the 

XT™ r ~ '"^-f"" "o"'- -native 

Ph^types of .he subject pathogen TYPl enzyme, and can be used in, for example, gene 
therapy protocols that target delivery of a rerArr.K,-«a„t „ . ' e <= 

TYPl mutant to a paUtogen. -oombtnant gene encodtng a dominant negative 

30 , K ' ■fYP' M»"^ « P««iuced by way of a degenerate 

r: f fl" '"'"""^ " "''^ of polypeptides which each include at least a p/nionTf 
P^ta, TYPl nucleotide sequences. A mix^re of synthetic oHgonucleotid^ can ^ 
«utymat,cally hgated mto gene sequences such that the degenerate se, of TYPl nucleotide 
sequences are exi^sible as individual polypeptides, or altentatively, as a set of larger fusion 

35 '■'■^■■'Z '^' *^'"^>'> «»= TYPl sequence library therei„: ,„ an 

35 tllustiauve embodtment. the library of TYP, phosphata^ mutants is expressed in the S 

po^ c,c:,.,2. strain described below. C„.xpr«sion of the wild-type TYP, 

e* a recomb.nan.ly pr«iuced TYPl from CanJUa. wid, a member of the TYPl variant 

hbnuy, m conjunction wid, detecting proliferation of ti« cells, will pennh d,e identiflcation 
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of dominant negative TYPl mutants which are able to rescue the otherwise hyper-mitotic 
cell. 

There are many ways by which the library of TYPl homologs can be generated from 
a degenerate oligonucleotide sequence. For instance, chemical synthesis of a degenerate 
5 gene sequence can be carried out in an automated DNA synthesizer, and the synthetic genes 
then ligated into an appropriate gene for expression. The purpose of a degenerate set of 
TYPl oligonucleotide sequences is to provide, in one mixture, all of the sequences encoding 
the desired set of potential TYPl sequences. The synthesis of degenerate oligonucleotides is 
well known in the art (see, for example, Narang, SA (1983) Tetrahedron 39:3; Itakura et al 

10 (1981) Recombinant DNA, Proc 3rd Cleveland Sympos, Macromolecules, ed. AG Walton, 
Amsterdam: Elsevier polypepiide273-289; Itakura et al. il9S4) Annu. Rev. Biochem. 53:323; 
Itakura et al (1984) Science 198:1056; Ike et al (1983) Nucleic Acid Res, 11:477). Such 
techniques have been employed in the directed evolution of other proteins (see, for example, 
Scott al (1990) Science 249:386-390; Roberts et al (1992) PNAS 89:2429-2433; Devlin et 

15 al (1990) Science 249: 404^06; Cwirla et al (1990) PNAS 87: 6378-6382; as well as U.S. 
Patents Nos. 5,223,409, 5,198,346, and 5,096,815). 

Moreover, there are several forms of mutagenesis generally applicable, in addition to 
a general combinatorial mutagenesis approach. For example, homologs of the subject 
proteins (both agonist and antagonist forms) can be generated and screened using, for 

20 example, alanine scanning mutagenesis and the like (Ruf et al (1994) Biochemistry 33:1565- 
1572; Wang et al {1994) J Biol Chem 269:3095t3099; Balint et al (1993) Gene 137:109- 
118; Grbdberg et al (1993) Eur J Biochem 218:597-601; Nagashima et al (1993) J Biol 
Chem 268:2888-2892; Lowman et al (1991) Biochemistry 30:10832-10838; and 
Cunningham et al (1989) ^c/ewc^ 244:1081-1085), by linker scanning mutagenesis (Gustin 

25 et al (1993) Virology 193:653-660; Brown et al (1992) Mol Cell Biol 12:2644-2652; 
McKnight et al (1982) Science 232:316); or by saturation mutagenesis (Meyers et al (1986) 
Science 232:613). Such techniques will be generally understood to provides for reduction of 
the subject regulatory proteins to generate mimetics, e.g. peptide or non-peptide agents, 
which are able to disrupt binding of a naturally-occurring form of a protein of the present 

30 invention with other cell-cycle regulatory proteins of the pathogen from which it was derived, 
e.g. disrupts the binding of the pathogen TYPl to a CDK. 

Thus, such mutagenic techniques as described above are particularly useful to map the 
determinants of the subject proteins which participate in protein-protein interactions. To 
illustrate, the critical residues of a TYPl protein which are involved in molecular recognition 

35 of a cyciin-dependent kinase, such as CDKK can be determined and used to generate TYPl- 
derived peptidomimetics which competitively inhibit binding of the phosphatase with the 
CDK (see, for example, "Peptide inhibitors of human papillomavirus protein binding to 
retinoblastoma gene protein" European patent applications EP-4 12,762 A and EP-B3L080A). 
By employing, for example, scarming mutagenesis to map the amino acid residues of one of 
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the subject TYPl involved in binding E6, peptidomimetic compounds (e.g. diazepine or 
isoqmnolme derivatives) can be generated which mimic those residues in binding to CDK 
and which therefore can inhibit binding of authentic TYPl to CDK and thereby interfere with 
the funaion of TYPl and/or the Kinase in proliferation of the pathogen. For instance non- 
hydrolyzable peptide analogs of such residues can be generated using benzodiazepine' (eg 
see Freidinger et al. in Peptides: Chemistry and Biology, G.R. Marshall ed., ESCOM 
Publisher: Leiden. Netherlands, 1988), azepine (e.g.. see Huffman et al. in Peptides- 
Chemistry and Biology, G.R. Marshall ed.. ESCOM Publisher: Leiden. Netherlands 1988) 
substituted y-lactam rings (Garvey et al. in Peptides: Chemistry arui Biology, G.R. Marshall' 
ed., ESCOM Publisher: Leiden. Netherlands, 1988), keto-methylene pseudopeptides 
(Ewenson et al. (19U) J Med Chem 29:295; and Ewenson et al. in Peptides: Structure and 
Function (Proceedings of the 9th American Peptide Symposium) Pierce Chemical Co 
Rockland, IL, 1985), p-ium dipeptide cores (Nagai et al. (1985) Tetrahedron Lett 26-647- and 
Sato et al. (1986) J Chem Soc Perkin Trans 1:1231), and P-aminoalcohoIs (Gordon et al 
(1985) Btochem Biophys Res Commun 126:419; and Dami et al. (1986) Biochem Biophys Res 
Commun 134:71). In similar fashion, mimetics can be designed which bind to any of the 
other subject regulatory proteins, or mimic their binding to other proteins. 

Another aspect of the invention pertains to antibodies and antibody preparations 
specifically reactive with at least one of the subject proteins. For example, by using peptides 
based on the cDNA sequence of one of the proteins represented in SEQ ID Nos 7-12 anti- 
protem/anti-peptide antisera or monoclonal antibodies can be made using standard meihods. 
A manunal such as a mouse, a hamster or rabbit, can be immunized with an immunogenic 
form of the peptide. Techniques for conferring immunogenicity on a protein or peptide 
include conjugation to carriers or other techniques well known in the art. An immunogenic 
form of the protein can be administered in the presence of adjuvant. The progress of 
immumzation can be monitored by detection of antibody titers in plasma or serum. Standard 
ELISA or other immunoassays can be used with the immunogen as antigen to assess the 
levels of antibodies. In a preferred embodiment, the subject antibodies are immunospecific 
for antigenic determinants of one of the pathogen-derived proteins of the present invention, 
e.g. antigenic determinants of a protein represented by one of SEQ ID Nos. 7-12 or a closely 
related homolog (e.g. 90 percent homologous, more preferably at least 95 percent 
homologous). In yet a firnher preferred embodiment of the present invention, antibodies do 
not substantially cross react (i.e. do not react specifically) with a protein which is: e.g. less 
than 90 percent homologous, more preferably less than 95 percent homologous, and most 
preferably less than 98-99 percem homologous with one of SEQ ID Nos 7-P By "not 
substanually cross react", it is meant that the antibody has a binding affinity for a 
nonhomologous protein, particularly orthologous proteins from mammalian cells, which is at 
least one order of magnitude, more preferably at least two orders of magnitude, and even 
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niore preferably at least three orders of magnitude less than the binding affinity of that 
antibody for one of the proteins of SEQ ID Nos. 7-12. 

An effective amount of a conjugate-containing composition is introduced into a host 
animal such as a goat, rabbit, mouse, rat, horse or the like to induce the production (secretion) 
5 of antibodies to the polypeptide. Effective amounts of inmiunogens useful for inducing 
antibody secretions in host animals are well known in the art. Methods of introduction into 
the host animal are also well known and are typically carried out by parental administration 
as by injection. A plxuality of such introductions is normally utilized so that the host is 
hyperimmimized to the immunogenic polypeptide-containing conjugate. For example, 

10 weekly introductions over a one-to-two-month time period can be utilized until a desired 
anti-polypeptide antibody titer is achieved. 

Following immunization antisera can be obtained and, if desired* polyclonal 
antibodies isolated from the senmi. To produce monoclonal antibodies, antibody producing 
cells (lymphocytes) can be harvested from an immunized animal and fused by standard 

15 somatic cell fusion procedures with immortalizing cells such as myeloma cells to yield 
hybridoma cells. Such techniques are well known in the art, and include, for example, the 
hybridoma technique (originally developed by Kohler and Milstein, (1975) Nature, 256: 495- 
497), the human B cell hybridoma technique (Kozbar et al, (1983) Immunology Today^ 4: 
72), and the EBV-hybridoma technique to produce human monoclonal antibodies (Cole et 

20 fl/., (1985) Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc. polypeptide. 77- 
96). Hybridoma cells can be screened inununochemically for production of antibodies 
specifically reactive with the immunogen and monoclonal antibodies isolated from a culture 
comprising such hybridoma cells. 

The term antibody as used herein is intended to include fragments thereof which are 

25 also specifically reactive with one of the subject proteins. Antibodies can be fragmented 
using conventional techniques and the fragments screened for utility in the same manner as 
described above for whole antibodies. For example, F(ab')2 fragments can be generated by 
treating a full antibody with pepsin. The resulting F(ab*)2 fragment can be treated to reduce 
disulfide bridges to produce Fab' fragments. The antibody of the present invention is further 

30 intended to include bispecific and chimeric molecules. 

An antibody preparation of this invention prepared from a polypeptide as described 
above can be in dry form as obtained by lyophilization. However, the antibodies are 
normally used and supplied in an aqueous liquid composition in serum or a suitable buffer 
such as PBS. 

35 Both monoclonal and polyclonal antibodies (Ab) directed against one of the subject 

regulatory proteins, and antibody fragments such as Fab' and F(ab')2, can be used to block the 
action of that protein and allow the study of its role in the cell-cycle or in cell proliferation. 
Moreover, such antibodies can also be used diagnostically to detect an infection involving 
Candida. 
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Moreover, the nucleotide sequence determined from the cloning of the subject 
regulatory proteins will permit the generation of probes designed for use in identifying the 
presence of a Candida infection such as an infection involving C.albicans. For instance the 
present invention provides a probe/primer comprising a substantially purified 
ohgonucleotide, wherein the oligonucleotide comprises a region of nucleotide sequence 
which hybndizes under stringent conditions to at least 10, more preferably 25 50 or 100 
consecutive nucleotides of sense or anti-sense sequence of one of SEQ ID Nos ' 1-6 or 
naturally occurring mutants thereof. In preferred embodiments, the probe/primer fimher 
compnses a label group attached thereto and able to be detected, e.g. the label group is 
selected from the group consisting of radioisotopes, fluorescent compounds, enzymes and 
enzyme co-factors. Such probes can be used as a part of a diagnostic test kit for identifying 
and phenotyping particular mycotic infections, such as in a sample of cells from a patient, or 
m a foodstuff, or on equipment. 

The present invention also provides assays and reagents for identifying anti-fungal 
and anti-parasitic agents, e.g. agents which act to inhibit proliferation of a pathogen by 
altenng the activity of one or more of the subject pathogen proteins. To illustrate, inhibitors 
of the Candida TYPl phosphatase can be used in the treatment of candidiasis- an 
op^rtunistic infection tiiat commonly occurs in debilitated and immunosuppressed patients 
TYPl inhibitors could be used to treat these infections in patients with leukemias and 
lymphomas, ,n people who are receiving immunosuppressive therapy, and in patients with 
such predisposing factors as diabetes mellinis or AIDS, where fungal infections are a 
particular problem. TYPl inhibitors can be generated for treatment of mycotic infections 
caused by, for example, Candida albicans, Candida stellatoidea, Candida tropicalis, 
Candida parapsilosis, Candida krusei, Candida pseudcropicalis. Candida guillermondii 
Candida glabrata. Candida lusianiae, or Candida rugosa. Anti-proliferative agents 
developed witii the subject assays can also be used, for example, as preser^.tives in 
foodstuff, as a feed supplement for promoting weight gain in livestock, or in disinfectant 
formulations for treatment of non-living .matter, e.g.. for decominating hospital equipment 
and rooms. Furthermore, as a result of tiie considerable divergence between TYPl proteins 
it IS hkely tiiat differential screening assays, e.g. side-by-side comparison of inhibition of 
human TYPl relative to one of tiie Candida TYPl enzyme, can be used to identify aeents 
that exhibit specific inhibitoiy effects directed at tiie form of the subject TYPl protein 
present m tiie pathogen, without substantially inhibiting a CDC25 phosphatase in human or 
other animal cells. Thus, by making available purified and recombinant proteins, tiie present 
invention facilitates the development of assays which can be used to screen for dmes which 
are eitfier agonists or antagonists of the nomial cellular fimction of the subject regulatory 
proteins. An inhibitor, as identified in tiie subject assays, is an agent which is able to cause a 
statistically significant decrease in one or more proliferative activities of a regulatory protein 
of the present invention. 
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In many drug screening programs which test libraries of compounds and natural 
extracts, high throughput assays are desirable in order to maximize the number of compounds 
surveyed in a given period of time. Assays which are performed in cell-free systems, such as 
may be derived vAxh purified or semi-purified proteins, are often preferred as "primary" 
5 screens in that they can be generated to permit rapid development and relatively easy 
detection of an alteration in a molecular target which is mediated by a test compound. 
Moreover, the effects of cellular toxicity and/or bioavailability of the test compound can be 
generally ignored in the in vitro system, the assay instead being focused primarily on the 
effect of the drug on the molecular target, as may be manifest in an alteration of binding 

10 . affinity between one of the subject proteins and other proteins with which they interact in 
changes in enzymatic activity of one of the subject proteins, or in changes in a property of the 
molecular target manifest from binding to one of the regulatory proteins. 

Accordingly, in an exemplary screening assay of the present invention, the compound 
of interest is contacted with an isolated and purified TYP 1 polypeptide which is ordinarily 

15 capable of binding a cyclin-dependent kinase. To the mixture of the compound and TYPI 
polypeptide is then added a composition containing a CDK polypeptide. Detection and 
quantification of CDK/rYPl complexes provides a means for determining the compound's 
efficacy at inhibiting (or potentiating) complex formation between the CDK and TYPI 
polypeptides. The efficacy of the compound can be assessed by generating dose response 

20 curves from data obtained using various concentrations of the test compoimd. Moreover, a 
control assay can also be performed to provide a baseline for comparison. In the control 
assay, an isolated and purified CDK is added to a composition containing the TYPI protein, 
and the formation of CDK/TYPl complexes is quantitated in the absence of the test 
compound. Efficacy of an agent is based on producing a statistically significant change in 

25 formation of such complexes relative to the control. It will be imdcrstood that, in general, the 
order in which the reactants may be admixed can be varied, and can be admixed 
simultaneously. 

Complex formation between the TYPI polypeptide and CDK polypeptide may be 
detected by a variety of techniques. For instance, modulation of the formation of complexes 

30 can be quantitated using, for example, detectably labeled proteins such as radiolabelled (e.g. 
32p^ 35s, ^"^C or ^H), fluorescently labeled (e.g. FITC), or enzymatically labeled 
polypeptides, by immunoassay, or by chromatographic detection. The use of enzymatically 
labeled TYPI or CDK proteins v^dll, of course, generally be used only when enzymatically 
inactive portions of those proteins are used, as each protein can possess a meeisurable intrinsic 

35 activity which can be detected. 

Typically, it will be desirable to immobilize one of the two polypeptides to facilitate 
separation of complexed from uncomplexed forms of one or both of the proteins, as well as to 
acconunodate automation of the assay. Binding of the CDK to TYPI, in the presence and 
absence of a candidate agent, can be accomplished in any vessel suitable for containing the 
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reactants. Examples include microtitre plates, test tubes, and micro-centrifuge tubes In one 
embod,.ent, a fusion protein can be provided wh,ch adds a dotnain that allows the pn,LinT 
be bound to a matnx. For example, glutathione-S-transferase^, (GST/TYPi™ 
protetns can be adsorbed onto glutathione sepharose beads (Sigma Chemical, St. Lo s MO 
> Slutathtone denvattzed mictotitre plates, which are then combined with the CDK 

polypepude e.g. an ^5s.,abeled CDK polypeptide, and the test compound, and the mi^e 

for salt and pH, though shghtly more stringent conditions may be desired, e g at 4°C in a 
buffer contammg 0.6M NaCl or a detergent such as O.m Triton X 100 Folio J 
:ncubat.n, the beads are washed to remove any unbound CDK p^^epUd ,td le Zx 
—zed radiolabel determined directly (e.g. beads placed^ scltill^ ^ He 

the sublet"'' fI ~ -^'^'^^ - in 

the subject assay. For mstance. either of the proteins can be immobilized utilizing 

conjugation of biotin and streptavtdin For imt«r... k- *• , ""^'^'^^B 

instance, biotmylated TYPl molecule«5 can h^. 
prepared from biotin-NHS (N-hydroxv-succmimiH»S • \ . ^oiecuies can be 

„ K . , . . "yoroxy-succmimide) usmg techniques well known in the art 

(e.g., biounylation kit. Pierce Chemicals RnrVfx^ tt n ^ . '"•"wn in me an 

. J- , '-nemicais, Kockford, IL), and immobilized in the weUs of 

streptavidin-coated 96 well plates (Pierce Chem.V^n au • . " ™ weus ot 

the TYPl but wh.Vh H .1 Chemical). Alternatively, antibodies reactive with 

ine i iri out which do not interfere with rrsir k;^^- , « . 

Diau and .h, TVPi , T ^ri^iai "o ll>e wells of the 

Place, and Ac TYPl polypepude trapped in the well, by antibody conjugation As above 
P^patauona of a CDK polypepUde and a tes, con,p„u„d are incubated in Z l^i 

^Z'^ZT """k ~ '-^^^ 

above for the OST-mrntobiltzed complexes, include immunodetecUon of complexes usine 
annbodres reacuve wid, the CDK polypeptide, or which are reactive with the^Pl n™ Z 
and compete for binding with U,e CDK polypeptide; as weU as el^rhl aZ s 

T """^ <='«">i'=^'y conjugated or 

prov ded as a fusron protcm with a CDK polypeptide. To illustrate, a CDKl ^lypL^ide cT 

oi \.,UK.i trapped m the complex with TYPi e-^r, k» j • , 

.K^ i YFl can be assessed with a chromogenic substrate of 

U.Zrr:'^- ^•'■■-'-^"■"e .erahydrochloHde or 4l,oro-l 

plTed'a^dtZlT" ^ Siutathione-S-transfcrase c^ be 

provided, and complex formauon quantilated bv detectino tu^ rc-r 

2.4.dinitroben«„e(Habig e. al (1,74,.5 Jc^Llr,^^ 
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As alluded to above, intrinsic enzymatic activities can be relied upon to detect the 
efficacy of an agent against TYPl. The detection of tHe TYPl phosphatase activity is 
described in more detail below. However, the downstream targets of TYPl, such as a CDK, 
may also have an intrinsic activity which can be utilized to quantitate the interaction with 
5 TYPl. In an exemplary embodiment, an enzymatically active TYPl is contacted with a 
phosphorylated CDK/cycIin complex, e.g. CDKl/CYBl, under conditions wherein, absent an 
inhibitor of the TYPl, that enzyme would dephosphorylate and activate the CDK/cyclin 
complex. Activation' could be detected by conversion of a substrate for the kinase complex, 
such as phosphorylation of a histone HI protein with ^^p.iabeled phosphate. 

10 For processes which rely on immunodetection for quantitating one of the proteins 

trapped in the complex, antibodies against the protein, such as either anti-CDK or anti-TYPl 
antibodies, can be used. Alternatively, the protein to be detected in the complex can be 
''epitope tagged" in the form of a fusion protein which includes, in addition to the CDK 
polypeptide or TYPl sequence, a second polypeptide for which antibodies are readily 

15 available (e.g. from commercial sources). For instance, the GST fusion proteins described 
above can also be used for quantification of binding using antibodies against the GST moiety. 
Other useful epitope lags include myc-epitopes (e.g., see Ellison et al. (1991) y Biol Chem 
266:21150-21157) which includes a 10-residue sequence from c-myc, as well as the pFLAG 
system (International Biotechnologies, Inc.) or the pEZZ-protein A system (Pharamacia, NJ). 

20 Moreover, the subject polypeptides can be used to generate an interaction trap assay, 

as described in the examples below (see also, U.S. Patent No. 5,283,317; Zervos et al. (1993) 
Cell 72:223-232; Madura et al. (1993) J Biol Chem 268:12046-12054; Bartel et al. (1993) 
Biotechniques 14:920-924; and Iwabuchi et al. (1993) Oncogene 8:1693-1696), for 
subsequently detecting agents which disrupt binding of TYPl to a CDK or other cell-cycle 

25 regulatory protein, such as a cyclin. 

The interaction trap assay relies on reconstituting in vivo a functional transcriptional 
activator protein from two separate fusion proteins, one of which comprises the DNA-binding 
domain of a transcriptional activator fused to a binding partner of TYPl, such as a CDK. The 
second fusion protein comprises a transcriptional activation domain (e.g. able to initiate RNA 

30 polymerase transcription) fxised to the TYPl polypeptide. When the CDK and TYPl 
domains of each fusion protein interact, the two domains of the transcriptional activator 
protein are brought into sufficient proximity as to cause transcription of a reporter gene. By 
detecting the level of transcription of the reporter, the ability of a test agent to inhibit (or 
potentiate) the interaction can be evaluated. Commercial kits for generating interaction traps 

35 are presently available (e.g., MATCHMAKER Kit, Clontech catalog No. kl 605-1, Palo Alto) 
and, in light of the present disclosure, can be modified for use as drug screening assays. 

In an illustrative embodiment, Saccharomyces cerevisiae YPB2 cells are transfonned 
simultaneously with a plasmid encoding a GAL4db-CDK fusion and with a plasmid encoding 
the GAL4ad domain fused to a the Candida TYPl. Moreover, the strain is transfomied such 
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that the GAL4-responsive promoter drives expression of a phenotypic marker. For example, 
the abihty to grow in the absence of histidine can depends on the expression of the HIS3 
gene. When the HIS3 gene is placed under the control of a GAL4-responsive promoter, relief 
of this auxotrophic phenotype indicates that a functional GAL4 activator has been 
reconstimted through the interaction of the CDK and the TYPl proteins. Thus, a test agent 
able to inhibit this interaction will result in yeast cells unable to growth in the absence of 
histidme. Alternatively, the phenotypic marker (e.g. instead of the HIS3 gene) can be one 
which provides a negative selection (e.g., are cytotoxic) when expressed such that agents 
which disrupt CDKmri interactions confer positive growth selection to the cells. 

It will be apparent that, in similar fashion to the exemplary TYPl-derived assays 
each of the other Candida regulatory proteins can be used to generate equivalent drug 
screemng assays which provide a protein-protein interaction as the inhibitory target For 
example, each of the CYBl, MOCl and CKSl proteins can be used to generate assays for 
detectmg agents which inhibit interaction with a CDK., such as CDKl. 

Moreover, for each of the subject regulatory proteins which have intrinsic enzvmatic 
activities, such as the TYPl. CDKI, MOCl and CMKl proteins, the present invention 
provides methods and reagents for identifying agents which inhibit the enzymatic activity of 
the protem, e.g. agents which are mechanism based inhibitors of the enzyme, rather than 
merely disrupting the formation of a protein complex. Inhibitors of the enzymatic activity 
can be identified, for example, using assays generated for measuring the ability of an agent to 
inhibit catalytic conversion of a substrate by one of the subject enzymes. Again using TYPl 
as an illustrative embodiment, a molecule or compound (e.g. a "test agent") to be assessed for 
Its ability to inhibit the phosphatase activity of the subject TYPl enzyme is combined with 
the enzyme and a substrate of its phosphatase activity. The resulting combination is 
maintained under conditions appropriate for the TYPl enzyme to act upon the substrate The 
conversion of the substrate to product by tiie subject TYPl enzyme is assessed, and the result 
compared to the rate or level of conversion of tiie substrate in the absence of the test agent. A 
statistically significant decrease in the activity of tiie TYPl phosphatase in the presence of 
the test agent, manifest as a decrease in conversion of substrate to product, indicates that the 
30 test agent is an inhibitor of the pathogen TYPl . 

In preferred embodiments, the substrate of the TYPl t>'rosine phosphatase is a 
synthetic substrate, e.g. a peptide or tyrosine analog, comprising a colorimetric or fluorescent 
label which IS detectable when the substrate is catalyticaliy acted upon by tiie TYPl. As used 
herein "colonmetric" refers to substrates detectable by change in absorption or fluorescent 
charactenstics. For instance, prefeired synthetic substrates include p-nitrophenylphosphate 
(pNPP), fluorosceindiphosphate (FDP), 3-O-methylfluoroscein phosphate (3-MFP). Other 
chromogenic substrates include 3-(p.hydroxyphenyl) propionic acid (HPPA), 2-Naphthyl 
phosphate, pyridoxal phosphate, adamantyl 1 ,2-doxetance phosphate, disodium 3-(4- 
methoxyspirol {I,2.dioxetane-3.2'-(5'-chloio) tricyclo [3.3.1.1] decan}-4-yl) phenyl 
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phosphate, Thymolphtalein monophosphate, 3-indoxyl phosphate and the like. Yet other 
substrates include radiolabeled peptides, such as peptides containing ^^p-iabeled 
phosphotyrosines, e.g, tyrosine phosphorylated forms of reduced carboxamindomethylated, 
maleyated lyzosyme (RCML) or CDC-derived peptides, wherein release of the radiolabel can 
5 be detected and correlated with TYPl enzymatic activity. 

In an illustrative embodiment, the method comprises the steps of: (a) combining a 
compound to be assessed, the subject Candida TYPl (purified or semipurified), and a 
synthetic substrate of the pathogen TYPl tyrosine phosphatase comprising a colorimetric label 
which is detectable when the substrate is acted upon by the TYPl (e.g., p- 

10 nitrophenylphosphate); (b) maintaining the substrate/enzyme/test compound combination 
under conditions appropriate for the pathogen-derived TYPl to act upon the substrate; and (c) 
determining, by colorimetric assay, the extent to which the TYPl enzyme present in the 
combination acted upon the substrate, relative to a control, the control comprising the TYPl 
and the substrate. If the subject TYPl enzyme acts upon the substrate to a lesser extent than 

15 m the control, the compound is an inhibitor of the pathogen TYPl tyrosine phosphatase 
aciivirv. 

In yet another embodiment of the present invention, inhibitors of the subject 
regulaior>' proteins which are involved in positive growth regulations are identified through 
their ability to rescue an otherwise hyper-mitotic cell from mitotic catastrophe, e.g. such as 

20 described in U.S. Patent Application 08/073,383. The term hyper-mitotic cell denotes a cell 
having an impaired cell-cycle regulatory protein which can cause premature progression of 
the cell though at least a portion of the cell-cycle and ultimately resulting in cell death. The 
hyper-mitotic cell of the subject assay can be generated, for example, by disrupting 
expression of a gene whose product acts antagonistically to one of the subject proteins, by 

25 overexpressing one of the subject proteins, or a combination thereof. In preferred 
' embodiments, the impaired checkpoint of the hyper-mitotic cell would, in normal cells, 
otherwise act as a negative regulator of downstream mitotic events induced by one of the 
regulatory proteins of the present invention. Impairment of such a negative regulator 
consequently allows the cell to proceed aberrantly toward subsequent mitotic stages and 

30 ultimately inhibits faithful proliferation of the cell. In the presence of an agent able to 
inhibit the function of the targeted regulatory protein, progression of the hyper-mitotic cell 
through the cell-cycle can be slowed to enable the cell to appropriately undergo mitosis and 
proliferate with fidelity. 

The present assay therefore provides a simple and rapid screening test which relies on 

35 scoring for positive proliferation as indicative of agents able to inhibit the function of, for 
example, one of the Candida regulatory proteins of the present invention, e.g., TYPl, CDKl, 
CYBl or MOCl . One advantage of the subject assay is that while direct inhibition of growth 
can be caused by any toxic compound added to a proliferating cell culture, growth 
stimulation in the present assay will only be achieved upon specific inhibition of the targeted 
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regulatory protein. Another advantage of the present assay is the amenity of the assay to high 
through-put analysis. 

With regard to the hyper-mitotic cell of the present assay, impairment of the 
regulatory protein can be generated so as to be either continual or conditional. A conditional 
impainnent permits the checkpoint to be noimatively operational under some conditions such 
that the cell may proliferate and be maintained by cell culture techniques; and be rendered 
inoperative, or alternatively hyper-operative, under other conditions. In the instance of the 
illustrative weel-50 mutant described below, the impaired checkpoint is effectively 
inoperative to an extent that the impairment allows aberrant mitosis to occur which concludes 
in mitotic catastrophe. A continual impairment, on the other hand, is one that is ever-present 
and which allows proliferation of the cell under conditions where there is no need to halt the 
cell at that checkpoint; but, in the instance of the hyper-mitotic cell, results in mitotic 
catastrophe under conditions where the cell-cycle must be halted, such as in the piesence of 
DNA synthesis inhibitors or DNA damaging agents. 

Regulatory pathways which feed into and modulate the activity of a CDK, such as 
CDKl, can be manipulated to generate the hyper-mitotic cell of the present assay. For 
example, as set out above, the inhibitory phosphorylation of cyclin-dependent kinases is 
mediated by at least two tyrosine kinases, initially identified in fission yeast and known as 
weel and mikl (Russell et al. (1987) Cell 49:559; Lundgren et al. (1991) Cell 64:111; 
Featherstonecra/. (1991) iVamre 349:808; and Parker a/. (1991) fMBO 10:1255). These 
kinases act as mitotic inhibitors, overexpression of which causes cells to arrest in the G2 
phase of the cell-cycle. For instance, overexpression of weel has been shown to cause 
intense phosphorylation of CDC2 (CDC28 in budding yeast) which results in cell-cycle 
arrest. Conversely, loss of function of weel causes advancement of mitosis and cells enter 
25 mitosis at approximately halfthenonnal size, whereas loss of weel andmikl function causes 
grossly premature initiation of mitosis, uncoupled from all checkpoints that nomially restrain 
cell division. Thus, weel and mikl. or homologs thereof, each represent suitable regulatory 
proteins which could be impaired to generate the hyper-mitotic cell of the present assay. 

Furthermore, it is apparent that enzymes which modulate the activity of the weel or 
mikl kinases can also be pivotal in controlling the precise timing of mitosis. For example, 
the level of the niml/cdrl protein, a negative regulator of the weel protem kinase, can have a 
pronounced impact on the rate of mitotic initiation, and niml mutants have been shown to be 
defective in responding to nutritional deprivation (Russel et al. (1987) Cell 49:569; and 
Feilotter et al (1991) Gertetics 127:309). Over-expression of niml (such as the S. pombe 
35 op-niml mutant) can result in inhibition of the weel kinase and allow premature progression 
into mitosis. In like manner, mutation in the stfl gene has also been shown to relieve 
regulation of mitotic progression in response to DNA synthesis inhibition. 

Loss-of-function strains, such as the S.Pombe weel-SO, or miklr.ura (Rowlev et al. 
(1992) Nature 356:353), are well known. In addition, each of the weel. mikl. and niml 
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genes have been cloned (see for example Coleman et al (1993) Cell 72:919; and Feilotter et 
al. (1991) Genetics 127:309), such that disruption of weel and/or mikl expression or over- 
expression of niml can be carried out to create the hyper-mitotic cell of the present assay. In 
a similar fashion, over-expression of weel and/or mikl or disruption of niml expression can 
5 be utilized to generate a hypo-mitotic cell. 

The hyper-mitotic cell of the present assay can be generated by manipulation of the 
cell in which one of the subject regulatory proteins expressed, as for example, by generating a 
weel mutation (a "wee" phenotype), or by exposure of the cell to 2-aminopurine or caffeine 
after a y-radiation induced G2 arrest It is also deemed to be within the scope of this 

10 invention that the hyper-mitotic cells of the present assay can be generated so as to comprise 
genetically engineered cells which express recombinant (e.g. heterologous) forafis of the 
subject proteins. For instance, each of the subject recombinant TYPl, CDKl, MOCl and 
CYBl genes can be expressed in cells other than Candida, but in which the Candida gene is 
able to rescue lack-of-function mutations of the orthologous activity is the host cell. For 

15 example, the subject TYPl gene can be used to replace the endogenous CDC25 gene of a 
hyper-mitotic Schizosaccharomyces cell, such as an S, pombe cell like the temperature- 
sensitive cdc25'22, weel -50 mutant described below. 

Moreover, in addition to complementation of CDC25-defective cells v^dth the subject 
TYPl, the reagent cells of the subject assay can be further engineered to also express other 

20 exogenous cell-cycle proteins which interact with TYPl, e.g- Candida CDK. In an 
illustrative embodiment, a hyper-proliferative cell in which a Candida TYPl is exogehously 
expressed can also be engineered to produce a Candida CDK (CDKl) and (optionally) a 
Candida cyclin (such as CYB 1 ) and/or a CAK (e.g. MOCl). In this manner, the reagent cells 
of the present assay can be generated to more closely approximate the natural interactions 

25 which the pathogen phosphatase might experience. 

In other embodiments, manipulation of cell-cycle regulatory pathways with certain 
drugs, temied here "hyper-mitotic agents", can induce mitotic aberrations and result in 
generation of the hyper-mitotic .cell of the present assay. For instance, caffeine, the protein 
kinase inhibitors 2-aminopurine and 6-dimethylaminopurine, and the protein phosphatase 

30 inhibitor okadaic acid can cause cells that are arrested in S phase by DNA synthesis 
inhibitors to inappropriately enter mitosis (Schlegel et al (1986) Science 232:1264; Schlegel 
et al (1987) PNAS 84:9025; and Schlegel et al (1990) Cell Growth Differ. 1:171). Further, 
2-aminopurine is believed to be able to override a number of cell-cycle checkpoints from Gl, 
S phase, G2, or mitosis. (Andreassen et al (1992) PNAS 89:2272; Andreassen et ai (1991) 

35 J. Cell Sci. 100:299, and Steinmann et ai (1991) PNAS 88:6843). For example, 2- 
aminopurine permits cells to overconie a G2/M block induced by y-irradiation. Additionally, 
cells continuously exposed to 2-aminopurine alone are able to exit S phase without 
completion of replication, and exit mitosis without metaphase, anaphase, or telophase events. 
The effect of inhibitors of, for example, TYPl function can therefore act to slow the 
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progression of the cell through the cell-cycle and, at appropriate concentrations, offset the 
effects of the hyper-mitotic agent so as to permit cell groxvtK rather than mitotic catastrophe 

Furthermore, to aid in the facilitation of mitotic catastrophe in the hyper-mitotic cell it 
may be desxraWe to expose the cell to an agem (i.e., a chemical or environmental stimulus) 
which ordtnanly induces cell-cycle arrest. Inappropriate exit from the chemically- or 
envtronmentally-induced anested state due to the impairmem of the negative regulatory 
checkpomt can ultimately be lethal to the cell. Such atresting agents can include exposure to 

1^?"'','^''''°" °' ''''^ ^^"^ ^'^^^ synthesis and repair 

using DNA polymerase irAibitors such as hydroxyurea or aphidicolin; topoisomeLe 
inhibitors such as 4'-dimethly-epipodophyllotoxin (VM.26); or agents which interfere with 
mtcrotubule-assembly, such as Nocadazole and taxol. By way of example, the weel-SO 
S.^.^^.e cells described below can be dosed with y-radiation in the presence of either 
caffeme, 2-am,nopunne, or 6-dimethyl-aminopurine. Each of these compounds can suppi^ss 

befor DNA repair has been completed. Inhibition of TYPI activation of a CDK/cyclin 
complex may result in an offsetting effect which slows cell-cycle progression such that, at 
appropnate concentrations, the TYPI inhibitor would rescue the hyper-mitotic cell. 
Add.t.onally. m certain cells, nutritional status of the cell, as well as mating factors, can 
cause arrest of the normal cell during mitosis. 

Agents to be tested for their ability to act as inhibitors can be produced by bacteria 

sucTr ^^^."^^^"^ P-'^""^)' P^^uced chemically (e.g. small molecules.' 

such as peptidomimencs), or produced recombinantly. The assay can be carried out in any 

inhibitors of the subject proteins would be expected to fully inhibit cell-cvcle progressL of 
even the hyper-mitotic cells, it will typically be desirable to perform the assay at various 

"'Tr'''^'' ™' ^^"^^ ^-d'^te agents 

can be added to the hyper-mitotic cell such that at least one concenti^tion tested the anti- 
mitotic agem iiAibits the regulatory protein to an extern necessary to adequately slow the 
Progression of the ce through the cell-cycle, but not to the extent necessary to completely 
inhibil entry of the cell into mitosis all together. 

Quantiftcation of proliferation of ^e hyper-mitotic cell in the presence and absence of 
LT!, agent can be measured t^ing a number of techniques »ell kno™ in the an. 
mclttdtng sunple measurement of population growth curves. For instance, where the assay 
mvolves prohferation in a liquid medium, turbidimetric techniques can be utilized (e.g 
b^rbence/^ansminance of ligh, of a given wavelength through the sample). For examp e. 
n *e embodtmen, wherem the t^ent cell is a yeas, cell, measurement of absorbence of 
h^t at a v^velength between «0 and 600nm can provide a convemently fas, mea^tre of cell 

score for prohferatton. Both of U«se techniques, especially with ,esp«, to yeas, ceUs a« 
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suitable for high through-put analysis necessary for rapid screening of large numbers of 
candidate agents. In addition, the use of solid media, such as agar-based plates, can further 
aid in establishing a serial dilution of the candidate agent. For example, the candidate agent 
can be spotted on a lawn of reagent cells plated on solid media. The diffusion of the 
5 candidate agent through the solid medium surrounding the site at which it was spotted will 
create a difiusional effect. For agents which inhibit the targeted regulatory protein, a halo of 
ceil growth would be expected in an area which corresponds to concentrations of the agent 
ATvliich merely offset the effect of the impaired checkpoint, but which are not so great as to 
over-compensate for the impairment or too little so as to be unable to rescue the cell. 

10 To further illustrate, other proliferative scoring techniques useful in the present assay 

include measuring the mitotic index for untreated and treated cells; uptake of detectable 
nucleotides, amino acids or dyes; as well as visual inspection of morphological details of the 
cell, such as chromatin structure or other features which would be distinguishable between 
cells advancing appropriately through mitosis and cells concluding in mitotic catastrophe or 

1 5 stuck at certain cell-cycle checkpoint. 

Exemplification 

The invention now being generally described, it will be more readily imderstood by 
reference to the following examples which are included merely for purposes of illustration of 
20 certain aspects and embodiments of the present invention, and are not intended to limit the 
invention. 

As described herein, we have isolated and characterized several genes from Candida 
which encode proteins that regulate progress of the Candida cell through mitosis and/or 
meiosis. As described in example 1, a phosphatase, TYPl, was cloned from C albicans^ and 

25 determined to be related to the CDC25 phosphatase gene family. To vaUdate the identity of 
the isolated gene, we demonstrate that it is able to rescue a temperature sensitive allele cdc25' 
22 of fission yeast. To salient features of the Candida TYPI gene are: although the TYPl 
gene has less than 50% homology with yeast cdc25 genes^ and less than 10% homology with 
the human cdc25 genes, the enzyme apparently performs the same function in regulation of 

30 cell cycle progression. Furthermore, despite earlier reports that certain preparations of the 
cdc25 phosphatase would not hydrolyze synthetic substrates in vitro (see Gautier et al. (1991) 
Cell 67:197-211, recombinant forms (including bacterially expressed) of the Candida TYPl 
enzyme are able to hydrolyze such substrates. 

35 Example 1 

Cloning of Candida TYPl 

In order to isolate a gene encoding a Candida TYPl phosphatase, the degenerate 

oligonucleotides ATGGATCCYTTRTANCCNCCRTSNARNANRTANAYNTCNGGRTA, 

ATGGATCCATIATIGAYTGYMGITWYCCITAYGA, and ATGGATCCATIATIGAYTG- 
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YMGITWYGArrAYGA were used to amplify C. alBicans genomic DNA in .ZAP (strain 
.15.A) by standard PGR protocols. The PGR reaction products were separated on a 2 5% 
low meltmg agarose gel that identified a sizable fragment (approximately 250 BP) Tie 
fragment was cloned into the pCRII vector (TA cloning system, Invitrogen) and the 
nudeotide sequence confinned the identity of the insert as a likely TYPl phosphatase DNA 
probes were generated as 32p.iabeled nick translation products of the fragment, and used to 
further screen C albicans cDNA libraries. Larger cDNA clones were isolated by this 
technique, and sequenced. The sequence of the open reading frame of the Candida TYPl 
gene ts gwen in SEQ. ID. No. 1, which also includes both 5' and 3' non-coding sequences 

To validate the identity of the isolated cDNA, the TYPl clone was tested for its 
abt hty to rescue the temperature sensitive allele cdc25.22 of the fission yeast. Briefly a 
1.2kbp EcoRI msert containing most of the open reading frame but lacking the amino part 

PART-TY>1. As described m the literature, e.g. see WO 94/28914 . the pARTl vector 
contams the constitutive ^. pomte ADH promoter, the arsl fragment for r^plification and the 
^. cerev,.ae LEU2 gene as a marker which complemems the Ieul.32 mutant in S. ponrbe 
T^nsfonnants growing on medium lacking leucine were streaked on plates and transferred at 
permtssive temperantre (37-0. It was observed that only the cells expressing the C. albicans 
gene were able to form colonies. Microscopic observations of the cells revealed the rescue 
from the cell elongation typical for thb mutant at restrictive temperature 

The C albicans TYPl cDNA gene was subsequently used to derive a fusion protein 
with glutaUnone-s-transferase in bacterial cells. Briefly, the EcoRI fragment described above 
was cloned mto the EcoRI site of pGEX-4T.l (Pharmacia). Expression of the fusion protein 
n coU was mduced by addition of IPTG (ImM) to the culture medium. After 4 hours of 
Ais regimen, cells were pelleted and resuspended in PBS plus various protease inhibitors 
The cell suspension was then sonicated and centrifuged to pellet the cell debris. TTie soluble 
fraction was collected and analyzed on SDS-PAGE and tested for phosphatase activitv n.e 
expression of the fusion protein was confinned by Western Blot using an anti.GST an'tibody 
As demonstrated m Figures lA and IB. the recombinant Candida TYPl phosphatase Z 
active against both para-nitrophenylphospate and fluorescein diphosphate. 

Example 2 
Cloning of Candida albicans CKSl 
In stoite ftshion .o the cloning of CandUa TYPl gene, a sucl homolog was 

TWOAR°T.VK.^!,'?'t '"^ ■"^•^ ^PMcation using Ae primers 

TWYGARTAYMONCAYOTOATG and AANARNARDATRTGNGGYTC. As above. ,he 
PGR fac,K>ns were separated on an agarose gel, U» ftagnten. elmed, and cloned in,o pCRII. 
DNA probes were generated as ^^?.U>.^^ nick u^ladon products, and used to funher 
screen a C. olUcam cDNA library. Larger cDNA clones isolated by this technit^e we« 
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sequenced. The nucleotide sequence for the CKSl open reading frame, plus flanking 
noncoding sequence, is provided in SEQ. ID. No, 2. 

The CKSl coding sequence was subcloned into a pQE vector (Qiagen), and used to 
produce native proteins. The purified proteins should isolate the Candida CDKl from cell 
5 lysates. 

Example 3 

Cloning of a Candida cyclin-dependent kinase 
Using the degenerate oligonucleotides TCNGGNGCNCKRTACCANARNGT and 
10 GGNGARGGNACNTAYGGNGTNGT, a cyclin-dependent kinase was isolated from a C 
albicans genomic library by PGR. The amplification program consisted of 30 cycles: 94*'C 
for 1 min., SO'^C for 1 min., and 72*'C for 1 min. Two major PGR reaction products were 
generated, separated on an agarose gel, and subsequently cloned into the pCRII vector, 
followed by standard Sanger sequencing. One of the two clones, a 490bp fragment, exhibited 
15 a reasonable degree of similarity with other members of the CDK gene product family and 
was accordingly used to screen a C. albicans cDNA library. 

Purified probes were generated as ^^P-labeled nick translation products, and 
hybridization was performed at 53®C overnight in Church's solution (7% SDS, 250mM NaP 
pH 7's, 10, mM, EDTA, pH7) and filters were washed twice at the same temperature in a 
20 buffer obtaining 2 x SSC and 0.1% SDS. The open reading frame for the cyclin-dependent 
kinase, referred to herein as CDKl, is given in SEQ. ID. No. 3. 

Example 4 

25 Cloning of aC albicans cyclin 

As above, the degenerate primers GANGANYICNGMNGANCCNYTNATG and 
ATNCKNCKNARRAARTTCAT were used to amplify C. albicans genomic DNA. The 
amplification program consisted of 30 cycles: 94**C for 1 minute, 43**C for 1 minute, and 72*" 
C for 1 minute. Two reaction products of about 450 and 700 bp were separated on an agarose 

30 gel- The 450 bp DNA fragment was reamplified and cloned into the pCRII vector and then 
used to screen a C albicans cDNA library. An apparent cyclin B homolog, referred to herein 
as CYBl, was isolated from the cDNA library. The open reading frame for this cyclin is 
given by SEQ. ID. No. 4. 

35 In C albicans and C maltosa^ the CUG codon, which encodes leucine in the 

universal codon usage, is believed to be translated as serine (amino acid residues 301 and 383 
of SEQ ID NO. 4). See, for example, Sugiyama et al. (1995) Yeast 1 1:43-52 and Zimmer et 
al. (1995) Yeast 11:33-41. Accordingly, it will be understood that an equivalent gene for 
expression in other cells csui be modified at these positions to a codon for serine. However, it 

40 is noted that expression of the CYBl gene in S. pombe produced what is apparently a 
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fimctional protein, suggesting that these residues do not effect the biological activity of the 
cyclin, or that Sugiyama et al. were incorrect. 

Sequence CLUSTAL alignment method (Higgins et al. (1992) Comp. Appi Bio-Sci. 
8:189-191) was run on the MegAlign program in the DNAStar package showed that the C 
albicans CYBl gene product is 34.8%, 34.4%, 35.5.%, 33.3%, and 33.7% identical to the ^ 
cerevisiae Clbl, Clb2 (Fitch et al. (1992) Mol Biol. Cell 3:805-818). 5. pombe Cdcl3 
(Booher et al.(1988) EMBO J. 7:2321-2327; Hagan et al. (1988) J. Cell Sci 91-587-595) 
Cig2 (Connolly et al. (1994) Mol. Cell. Biol 14:768-776) and^. nidulans NimE (O'Connell et' 
al. il992) EMBO J. 1 1:2130-2149) proteins, respectively. Percentages of identitv increase up 
to 57% when only the C-terminal pans, containing the cyclin box, of the fungi B-type cyclins 
are aligned. The destruction box (RQYLGDVSN, amino acids 67 to 75 of CYBl ) matches 
perfectly the consensus RxxLxxxxN which is essential for cyclin degradation via the 
ubuqumn pathway (Glotzer et al. {\99\) Nature 349:132-138). The P box. which is required 
for Cdc25 activation by the MPF complex (Galaktionov et al. (1991) Cell 67 1181-1194- 
Zheng et al. (1993) Cell 75:155-164) is also present on the C. albicans Cybl protein (amino 
acids 237 to 268, SEQ ID NO. 4). Cybl P box is 58.8o/„, 64.7o/o, 67.6%, 61.8% and 70.6% 
Identical to the S, cerevisiae Clbl, CIb2, S. pombe Cdcl3, Cig2, and A. nidulans NimE P 
boxes, respectively. 

Example 5 

C albicans CDKl complements the S. pombe cdc2.33 temperature sensitive mutation. 
To test if the CDKl cDNA is a functional gene the full length CDKl cDNA was 
cloned into the 5. pombe pARTl expression vector (McLeod et al. (1987) EMBO J. 6 729- 
736), yielding pCDKl.5. pARTl contains the S. cerevisiae LEU2 gene that complements a 
5. pombe leul-32 mutation, the S. pombe arsl sequence, and the S. pombe adh promoter 
which mitiates strong and constitutive transcription. pCDKl.5 was used to transfonn the 
temperature sensitive S. pombe cdc2-33 strain (Nurse et al. (1976) Nature 146:167-178) 
Transfonnants were obtained at 250C, which is the pemiissive temperature for cdc2-33. 
They were then streaked for single colonies and incubated at 25<'C, SS'C, or 37»C. The C. 
albica,.3 CDKl gene enables a S. pombe cdc2-33 strain to fomi colonies at both 35°C and 37» 
C, however, complementation is not as good as when the S. pombe wild-type cdc2 gene is 
used. 

Example 6 

C. albicans CYBl complements the S. pombe cdcH-U 7 temperature sensitive mutation 
To test if the CYBl cDNA is a functional gene the full length CYBl cDNA was 
cloned mto the S. pombe pARTl expression vector (Mc Leod et al. (1987) EMBO J 6 729- 
736). The resulting plasmid pCYBl.5 was used to transform a temperature sensitive S 
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pombe cdcl3-117 strain (Nasmyth et al. (1981) MoL Gen. Genet. 182:119-124). 
Transformants obtained at 25"* C were then streaked for single colonies and incubated at 25®C, 
SS^'C, or 37''C. The C albicans CYBl gene product is able to rescue a S. pombe cdcl3-l 17 
mutation at 35°C, but no colony formation was observed at ST'^C, indicating a panial rescue. 



Example 7 

Interaction between the CDKl and CYBl proteins 
Using the primers GACCAACACGAATTCCAAATGGTAGAGTTATCTG and 

10 TGAGGAGTCGACCAAGATTTATTGCATG, which contain EcoRI and a Sail restriction 
sites, respectively, the CDKl coding sequence was amplified and subcloned into pEG202 
vector in order to created a CDKl-LexA fusion protein. Likewise, the CYBl coding 
sequence was amplified v^th the oligonucleotides CATTTTGAATTCATAGTA- 
ATGCCACAAGTC and ATAGTCCTCGAGACTTTACTCTTCTGCTTC, cut with EcoRI 

15 and Xhol, and the restriction fragment was subcloned into the vector pJG4-5 (Gyuris et al. 
(1993) Cell 75:791-803) in order to generate a CYB1-VP16 fusion protein. 

The two vectors were used to simultaneously transform the S xerevisiae strain 
YEG048 so as to constitute an interaction trap assay. Analysis of the transformants revealed 
that the CDKl and CYBl proteins interact with one another. 

20 Example 8 

Generation of a TYP Independent hypermitotic cell 
When the TYPl plasmid construct pART-TYPl, described above, is used to 
transform the S. Pombe strain Sp553 (h+N, cdc25'22, weel-50, leul'32) using well known 
procedures. Briefly, cells are grovm in YE medium at 25 °C until they were in exponential 

25 phase (-10'^ cells/ml). The cells are then spun down from the media at 3000rpm for 5 
minutes, and resuspended in LiCl/TE at a concentration of -10^ cells/ml (LiCl/TE=10mM 
Tris, ImM EDTA, 50 mM LiCl, pH 8). The resuspended cells are incubated at room 
temperature for 10 minutes, then spun again at 3000rpm for 5 minutes, resuspended in 
LiCiyTE to a concentration of -5 x 10^ cells/ml, and shaken for 30 minutes at 25°C. 

30 To an aliquot of 150^1 of cells, 500 ng of plasmid DNA and 350^iL of PEG/TE 

(lOmM Tris, ImM EDTA, 50% PEG 4000, pH 8) is added. The celVplasmid mixture is then 
incubated for 30 minutes at 25°C, heat shocked at 42°C for 20 minutes, then spun at 15,000 
rpm for 10 seconds after the addition of 0.5 mL of Edinburgh Minimal Medium (EMM). The 
cells were resuspended in 0.6 mL EMM, and 0.2 mL aliquots were plated. 

35 At the non-permissive temperature of 37°C, both the endogenous weel and CDC25 

activities of the Sp553 cells are impaired such that they mutually off-set each other's effects, 
and the cells are still able to proliferate. However, the effect of expressing the recombinant 
Candida TYPl protein in a yeast "wee" background results in mitotic catastrophe. For 
example, at the peraiissive temperature of 25^C (weel is expressed) the cells are able to 
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proliferate. However, shifting the temDerature to th^ ««« • • 

... ® icmperaiure to the non-permissive temperature of ST'C 

results m mitotic catastrophe. 

Example 9 

Assay for TYPl inhibitors using a hypermitotic cell 
To assay the anti-mitotic activity of various candidate agents, the cells of Example 6 
are e.ther plated on a solid medium such as EMM plates or suspended in an appropriate 
vegetative broth such as YE. k <ui uppropnaie 

loned on o ft= p.a,e, and the pla.e incuba,^ at the non-pcnnissiv. ,emp«a.n. of 3rC A 
.0 ha o of eel, stow* wi„ fo™ surrounding ,ho« agents able to a. fea^ parUally Inhibit^ 
milouc activator wh.ch can rescue the otherwise catastrophic cell 

Wiere growth of a,e cells is carried ou, in a vegetative broth, aiiquots of celWnedia 
^ placed .„ .he wells of .icr^titre plates and serial dilutions of candidate agents are a^ 
W aT '-^"'^ - "-C. and the for each well measured ov^ 

tu™ and con,pa,ed to similar wells of cells/media which lack the candidate agent (eg 
negative controls). An increase in absorbence over time relative to the negaUve conttot 
rnd^a es positive pr„life.Uo„ of the cells and suggests an ability of a particular candid^^e 
agent to inhibit a mitotic activator. 



15 



20 



25 



reference. 



All of the above-cited references and publications are hereby incorporated by 



Equivalents 

Those skilled in the art will recognize or » - • 

r™.,:«» » • • 'ct.ognize, or be able to ascertain using no more than 

routine experimentation, numerous equivalents tn ih^- «o<.^.f-.^ 

. .„ c . . , cHuivaients to the specific assay and reagents described 

rid :t ztrciir" ^ - - — - - 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

5 (i) APPLICANT: 

(A) NAME; Mitotix, Inc. 

(B) STREET; One Kendall Square, Building 600 

(C) CITY; Cambridge 

(D) STATE: MA 

10 (E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 02139 

(G) TELEPHONE: (617) 225-0001 

(H) TELEFAX: (617) 225-0005 

15 (ii) TITLE OF INVENTION: Cell-Cycle Regulatory Proteins from 

Human Pathogens, and Uses Related Thereto 

(iii) NUMBER OF SEQUENCES: 12 

20 (iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC coti^atible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: ASCII (text) 



25 



40 



55 



<vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/463,090 

(B) FILING DATE: 05-JUN-1995 



30 (2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1668 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY : CDS 

(B) LOCATION: 259.. 1491 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
GGATGATACA AATGTGGAAG ATGCAAATTG TTCTTCCCCT ACTTTGATGA GAAAAAGTGC 60 
50 ATTGAGTAAA ATCATCTTCA AAGGACATTA AACAATAATT CCAAATCACC ATCGCCAACT 120 
TTTTCAAATA CAAATGCAAC ATCTGGCTCT CCATTGTCAA ATCTTTCTCG TGGACCATTG 180 
AGAAATTTAT CTAATTTCGT TATTCCTTCG TCAGTTAAAT CAAAAACGAA ACAATTTACA 24 0 



AACTCTTTGA CTCGTtCA ATG ACT GAA GTG GTT TCG AAA TCA TCA CAC TCA 291 

Met Thr Glu Val Val Ser Lys Ser Ser His Ser 
1 5 .10 
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TTT TTC AAT AAT TTG CAT CTT GCA ACC TCA ACT GCG TCT TCT TCA GTA 
Phe Phe Asn Asn Leu His Leu Ala Thr Ser Thr Ala Ser Ser Ser Val 
IS 20 2S 

TCG AGC ACA ACT CCC AAA ATA GAA TTC AAT TCC ATA GCT GAA AAT GAT 
ser ser Thr Thr Pro Lys He Glu Phe Asn Ser He Ala Glu Asn Asp 
2° 35 40 



25 



35 



45 



CTA CAA AAA TTA GAC GGT TCC ACT CCA TGT ACC AAA CCG ATA AAG 
Tyr Leu Gin Lys Leu Asp Gly Ser Thr Pro Cys Thr Lys Pro He Lys 



100 



105 



AGA TTG CAT CGT ACC AAC TTC ATG AAG ATA ATT CAT TTT GAA ATT TAG 
Arg Leu Hxs Arg Thr Asn Phe Met Lys He He His Phe Glu He T^ 
"° "5 120 



135 



GCT CAT CAA TTA GCC AGA ATA TTA CGT GGA GAC CAC GAT GAC CAA TTT 
Ala HXS Gin Leu Ala Arg He Leu Arg Gly Asp His Asp Asp Gin Phe 
j.«u 145 



ISO 



155 



40 2.70 

GGC CAT ATT ACT AGG GCA ATC AAT ATC TCC ACC CAG GAA GCA CTT CAA 
- Gly HXS He Thr Arg Ala He Asn He Ser Thr .Gin Glu Ala Leu ^ 



175 180 



185 



GAA AA. CTC TTT CAA TAT CAA GAA ACA GAT ACC AAG GAC ACT GAA AGC 
Glu Lys Leu Phe Gin Tyr Gin Glu Thr Asp Thr Lys Asp Thr Glu Jer 
"° 195 200 



339 



387 



10 T ?r I K ''''^ "^"^ TTC GAA GAT GGT GAT 435 

10 Asp He Pro Thr Asn Tyr Asp Ser Asp Glu Glu Phe Glu Asp Gly Asp 
45 50 55 



483 



?hr S ?r T f ti"" "^"^ ^'^^ CAA GTA 

Thr Phe He Gin Ser Thr Leu He His Gin Phe Asn Ala Ser Gin Val 

^° «5 70 75 

ACA ACA ACA ACA ATA ATA ATA ATA CCA ATG ATG GTA ACG ACA ATA ATA 531 
Thr Thr Thr Thr He He He He Pro Met Met Val Thr He tie 

20 ^° 90 



579 



S27. 



30 '"'^'^ '^'^ ^T "A CCA CGA ATC GAT 675 

30 Asn He Glu Tyr Ser His Ser Glu Ser Asp Leu Leu Pro Arg He Asp 



723 



GAT GAA TTT ATT ATC ATT GAT TGT CGA TTT GAG TAT GAA TTT AAT GGT 771 
ASP Glu Phe He He He Asp Cys Arg Phe Glu Tyr Glu ^ ^n Sy 
160 



819 



867 



50 fJ« ^ » r° ''T'' ^C AGT ATG TTC AGA GGA 915 

50 Lys Lys Arg Leu He He Phe His Cys Glu Phe Ser Met Phe Arg Gly 
205 210 215 

pS wit mI^ I';'' '^'^ '^'^ ^^-^ AAC TAC 

55 220 225 '"'^ ""^^ ^11 """^ "^^ ""^^ 

2^ ^S"" f ^ """^ ^'^'^ ^"^^ <5CA ATT TTX; GAA GGA 

Asp Asn Tyr Pro Leu Leu Thr Tyr Pro Asp He Ala He Leu Glu Gly 

245 250 



963 



1011 
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GGC TAT AAG AAT TTC TAT GAA AAT TAC CCC CAA TGG TGT GAT CCT CAA 1059 
Gly Tyr Lys Asn Phe Tyr Glu Asn Tyr Pro Gin Trp Cys Asp Pro Gin 
255 260 265 

5 

GGA TAT GTC GAG ATG AAG AAT TTA CGA CAC AAA AAA TTA TGT GAA TCC . 1107 
Gly Tyr Val Glu Met Lys Asn Leu Arg His Lys Lys Leu Cys Glu Ser 
270 275 280 

10 AAC TTG GAT AAA GTT AGA AAA GAT AAT AAA CTA ACT AGA GCA AAG TCT 1155 
Asn Leu Asp Lys Val Arg Lys Asp Asn Lys Leu Thr Arg Ala Lys Ser 
285 290 295 

TAT CAA TTT GOT ATT CAA CAC CGC CGT GGT GGT TCC ACT GGT GGA CTT 1203 
15 Tyr Gin Phe Gly lie Gin His Arg Arg Gly Gly Ser Thr Gly Gly Leu 
300 305 310 315 

TTC GGC AAC TAT AAT TAC AAC GTT ATG AAC TCA TCA GAT CAA CAA TTT 1251 
Phe Gly Asn Tyr Asn Tyr Asn Val Met Asn Ser Ser Asp Gin Gin Phe 
20 320 325 330 

TGG AGT AGC AGT ACT TCC AAC ACT GCT CAC CAC AGA AGT AGT AGC AGT 1299 

Trp Ser Ser Ser Thr Ser Asn Thr Ala His His Arg Ser Ser Ser Ser 
335 340 345 

25 

AGC GGG TTC ATT AAT AAT ATG CAT AGT GGT GCT TCG TCA TAT CAC CAT 1347 

Ser Gly Phe lie Asn Asn Met His Ser Gly Ala Ser Ser Tyr His His 

350 355 360 

30 AGG TCA CAA TCG TTT GTA ACT ATC AAT AAT GAG AAA ATT ATC AAG CGA 1395 
Arg Ser Gin Ser Phe Val Thr lie Asn Asn Glu Lys lie He Lys Arg 
365 370 375 

CAA AGA TCG ACT CCC AAA GTC AGC AAC TCA CCA ACC AAG CCA CCT CAT 1443 
35 Gin Arg Ser Thr Pro Lys Val Ser Asn Ser Pro Thr Lys Pro Pro His 
380 385 390 395 

CAA CTG TAT CTC CTG ATA AAC CCA TTC CGT TGG CTA ATA TTC ATA GAT 1491 
Gin Ser Tyr Leu Ser He Asn Pro Phe Arg Trp Leu He Phe He Asp 
40 400 405 410 

TAACTCGTGC CAACACTATT TCATCAGACC AAACATTGTT TAGCAATAAG CTGGTATCTT 1551 

CCCCAATGAT ATCTCCACTT GCAGCTAGTT TTGAACAATC GTCGATTGGA ATAAGTTCTT 1611 

45 

CTGAATTATC AGTCAATACT CAAGATTTTC AACCACCGAC TACGTCCTTT AGGAATT 1668 



50 



(2) INFORMATION FOR SEQ ID NO: 2: 



(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 78 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 
55 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 208.. 513 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

AACTTGTTTA CTTATTTGTT TATATAATTG ATAGATATCA ATTACTAATT TACCCTTGTT 60 

10 TTTTACTTCC TACTATTCAA GACTXTATTT CCTCCTGATA ATCATTTTGT TTGATTATCA 120 

TTTTCGTCAA TTAGTTCTTT TTTTTCATTT GTTTCCAGAG TTTAGGAAGA CTACCATTTT 180 

ACAATTTTCA ATTCAAATAT TTTCCCA ATG ACT AAA CCA AGA TTT TTA ACA 231 
15 Met Thr Lys Pro Arg Phe Leu Thr 

1 5 

AGA TAT AGA AAG AGC AAA AGT GTT GGA ATT TCA GAT ATG ATC CAT TAC 279 
Arg Tyr Arg Lys Ser Lys Ser Val Gly lie Ser Asp Met lie His Tyr 
20 10 15 20 

AGT CCC AGA TAC AGT GAT GAT TCA TAC GAG TAT AGA CAT GTG ATG TTA 327 

Ser Pro Arg Tyr Ser Asp Asp Ser Tyr Glu Tyr Arg His Val Met Leu 
25 30 35 40 

25 

CCC AAG AAT ATG TTG AAA GCA ATT CCT CAC GAT TAC TTT AAT CAA GAA 375 

Pro Lys Asn Met Leu Lys Ala lie Pro His Asp Tyr Phe Asn Gin Glu 
45 50 55 

30 ACA GGT ACT TTG AGG ATA TTG ACA GAA GAA GAA TGG AGA GGG TTA GGA 423 
Thr Gly Thr Leu Arg He Leu Thr Glu Glu Glu Trp Arg Gly Leu Gly 
60 65 70 

ATC ACA CAA TCT TTG GGT TGG GCC CAT TAC GAA ACT CAT GCT CCA GAG 471 
35 He Thr Gin Ser Leu Gly Trp Ala His Tyr Glu Thr His Ala Pro Glu 
75 80 85 

CCT CAT ATA TTA TTA TTC AAG AGA CCC TTA AAT CCC GGG CAA 513 
Pro His He Leu Leu Phe Lys Arg Pro Leu Asn Pro Gly Gin 
40 90 95 100 

TAAAAAGATT AACTATATTT GAATACTATA GAATCGGAAT CGGTTTTAAA GTTAACACTG . 573 

GAATTAAAAC ATAAAAAGGA AAGAAATAGC CCATTGGTCA CAGTAATCTG TTTCCAACAA 633 

45 

CCCCCCTCCT CAGAAATAGG ATAGAAATGA ATTAACGATG AATTTGTATA CACTATTTAT 693 

AAGCCAATCT CATTGATTGC ATTTCTTATT TGTATATTAT TAAATACGTA TATCGCGAGA 753 

50 AACTGTATAA ATACTCTTGG TACCTCGCAT GTT 786 



(2) INFORMATION FOR SEQ ID NO: 3: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1002 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

( D ) TOPOLOGY : 1 inear 
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(ii) MOLECULE TYPE: cDNA 



5 (ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 43.-993 



10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 



15 



TAGAACACAC ACATCCCAAG CCAAGACCAA CACTTATTGC AA ATG GTA GAG TTA 54 

Met Val Glu Leu 
1 

TCT GAT TAT CAA CGT CAA GAA AAA GTC GGA GAA GGT ACT TAT GGG GTT 102 
Ser Asp Tyr Gin Arg Gin Glu Lys Val Gly Glu Gly Thr Tyr Gly Val 
5 10 . 15 20 

20 GTT TAT AAA GCA TTA GAT ACC AAG CAC AAT AAT AGA GTT GTT GCA TTA 150 
Val Tyr Lys Ala Leu Asp Thr Lys His Asn Asn Arg Val Val Ala Leu 
25 30 35 

AAG AAA ATT CGA TTA GAA TCA GAA GAT GAA GGT GTA CCT AGT ACC GCC 198 
25 Lys Lys lie Arg Leu Glu Ser Glu Asp Glu Gly Val Pro Ser Thr Ala 
40 45 50 

ATT AGA GAA ATC TCG TTA TTA AAA GAA ATG AAA GAT GAT AAT ATC GTT 246 
lie Arg Glu lie Ser Leu Leu Lys Glu Met Lys Asp Asp Asn lie Val 
30 55 60 65 

CGA TTA TAT GAT ATT ATT CAT TCA GAT TCT CAT AAA TTA TAT TTA GTA 294 

Arg Leu Tyr Asp lie lie His Ser Asp Ser His Lys Leu Tyr Leu Val 

70 75 80 

35 

TTT GAA TTT TTG GAT TTA GAT TTA AAG AAA TAT ATG GAA AGT ATT CCT 342 

Phe Glu Phe Leu Asp Leu Asp Leu Lys Lys Tyr Met Glu Ser lie Pro 

85 90 95 100 

40 CAA GGA GTT GGA CTA GGG GCT AAT ATG ATA AAA AGA TTT ATG AAT CAA 3 90 

Gin Gly Val Gly Leu Gly Ala Asn Met lie Lys Arg Phe Met Asn Gin 
105 110 115 

TTA ATT CGA GGT ATT AAA CAT TGT CAT TCT CAT CGA GTT TTA CAT CGT 438 
45 Leu lie Arg Gly lie Lys His Cys His Ser His Arg Val Leu His Arg 
120 125 130 

GAT TTA AAA CCA CAA AAT TTA TTG ATT GAT AAA GAA GGG AAT TTA AAA 486 
Asp Leu Lys Pro Gin Asn Leu Leu lie Asp Lys Glu Gly Asn Leu Lys 
50 135 140 145 

TTA GCA GAT TTT GGA TTA GCT CGA GCA TTT GGA GTT CCA TTA AGA GCA 534 
Leu Ala Asp Phe Gly Leu Ala Arg Ala Phe Gly Val Pro Leu Arg Ala 
150 155 160 



55 



TAT ACT CAT GAA GTT GTC ACT TTA TGG TAT CGA GCT CCC GAA ATC TTG 582 
Tyr Thr His Glu Val Val Thr Leu Trp Tyr Arg Ala Pro Glu lie Leu 
165 170 175 180 
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TTA GGA GGG AAA CAA TAT TCC ACT GGG GTA GAT ATG TGG TCT GTT GGA 630 

Leu Gly Gly Lys Gin Tyr Ser Thr Gly Val Asp Met Trp Ser Val Gly 
185 190 ' 195 

TGT ATA TTT GCT GAA ATG TGT AAT AGG AAA CCA TTA TTT CCT GGT GAT 678 
Cys He Phe Ala Glu Met Cys Asn Tlrg Lys Pro Leu Phe Pro Gly Asp 
200 205 210 

TCA GAA ATT GAT GAA ATT TTC CGA ATT TTC CGA ATT TTA GGA ACA CCT 726 
Ser Glu He Asp Glu He Phe Arg He Phe Arg He Leu Gly Thr Pro 
215 220 225 

AAT GAA GAA ATT TGG CCT GAT GTT AAT TAT TTA CCA GAT TTT AAA TCA 774 
Asn Glu Glu He Trp Pro Asp Val Asn Tyr Leu Pro Asp Phe Lys Ser 
230 235 240 

AGT TTC CCT CAA TGG AAA AAG AAA CCT TTG AGT GAA GCA GTT CCA ACT 822 
Ser Phe Pro Gin Trp Lys Lys Lys Pro Leu Ser Glu Ala Val Pro Ser 
245 250 255 260 

TTG GAT GCT AAT GGA ATT GAT CTT TTG GAT CAA ATG TTG GTG TAT GAT 870 
Leu Asp Ala Asn Gly He Asp Leu Leu Asp Gin Met Leu Val Tyr Asp 
265 270 275 

CCA AGT AGA AGA ATA AGT GCT AAA CGA GCT TTA ATT CAT CCT TAT TTT 918 
Pre Ser Arg Arg He Ser Ala Lys Arg Ala Leu He His Pro Tyr Phe 
280 285 290 

AAT GAT AAT GAT GAT CGT GAT CAT AAC AAT TAT AAT GAA GAT AAT ATT 966 
Asn Asp Asn Asp Asp Arg Asp His Asn Asn Tyr Asn Glu Asp Asn He 
295 300 305 

GGG ATT GAC AAA CAC CAA AAC ATG CAA TAAATCTTG 1002 
Gly He Asp Lys His Gin Asn Met Gin 
310 315 



(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1752 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 184 -.1659 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GCTATTCCCC CCTTTTCCTT TTTTTTATAG AGAAACTTAT TCCAATTACT CATCGAACAA 60 
GATCTTACTA GACTTGTAGA CTCACGACAC GATAAATTTT AATTCATTAA TCAACCAACG 120 
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AACCAGCCAA ACCAAAATTA ATTCACATTT ATACTCACTG TTTGTCATTT TCATCTCATA 180 

GTA ATG CCA CAA GTC ACT AAA ACT AAT AAT GAA AAT GAG TTT AGA CTT 228 
Met Pro Gin Val Thr Lys Thr Asn Asn Glu Asn Glu Phe Arg Leu 
1.5 10 15 

ACT AGA TCA AAA GTA CAG CAT CAA GAG TCG ATA AGT ACC ATC AAA AAT 276 
Thr Arg Ser Lys Val Gin His Gin Glu Ser lie Ser Tlir lie Lys Asn 
20 25 30 

ACG ACC ATA TCC AAT TCT CAG CAT AAA CAA CAA ACA CAA CAA CAA ATT 324 
Thr Thr lie Ser Asn Ser Gin His Lys Gin Gin Thr Gin Gin Gin lie 
35 40 45 

15 TCA TCA CCA CCT CAA GTC TCT GTA ACA TCA TCT GAA GGA GTT TCA CAT 372 
Ser Ser Pro Pro Gin Val Ser Val Thr Ser Ser Glu Gly Val Ser His 
50 55 60 

GTC AAT ACA CGT CAA TAT TTG GGT GAT GTT TCA AAT CAA TAC ATA ACA 420 
20 Val Asn Thr Arg Gin Tyr Leu Gly Asp Val Ser Asn Gin Tyr lie Thr 
65 70 75 

AAT GCT AAA CCA ACA AAT AAA AGA AAA CCA TTG GGT GGA GAC AAT GCC 468 
Asn Ala Lys Pro Thr Asn Lys Arg Lys Pro Leu Gly Gly Asp Asn Ala 
25 80 85 90 95 

CCT CTA CAA AAA CAA CAG CAT AGA CCA TCT AGA CCA ATA CCC ATT GCC 516 
Pro Leu Gin Lys Gin Gin His Arg Pro Ser Arg Pro lie Pro lie Ala 
100 105 110 

30 

AGT GAT AAC AAC AAT AAT GGT AGT ACC AGT AGC AGT AGC AAC AGT AGC 564 
Ser Asp Asn Asn Asn Asn Gly Ser Thr Ser Ser Ser Ser Asn Ser Ser 
115 120 125 

35 AAC AAC AAT AAC AAC GAC GCA AAT AGA CTA GCA TCT TTG GCA GTT CCA 612 
Asn Asn Asn Asn Asn Asp Ala Asn Arg Leu Ala Ser Leu Ala Val Pro 
130 135 140 

TCT CGA TTA CCC CAA AAA CGA CAA GCT ACT GAA TCG TCG ACA AAT TTA 660 
40 Ser Arg Leu Pro* Gin Lys Arg Gin Ala Thr Glu Ser Ser Thr Asn Leu 
145 150 155 

GTA GAG AAA TTA AGA GTA CCA CAA CCA GAA GTA GGG GAA AGA AGT CAG 708 
Val Glu Lys Leu Arg Val Pro Gin Pro Glu Val Gly Glu Arg Ser Gin 
45 160 165 170 175 

TCA TAC CAT AAG AAA TCA CGT TTA ATT GAT TAT GAA TGG CAG GAT TTG 756 
Ser Tyr His Lys Lys Ser Arg Leu lie Asp Tyr Glu Trp Gin Asp Leu 
180 185 190 

50 

GAT GAA GPA GAT AAT GAC GAC CAA TTA ATG GTT AGT GAA TAT GTT AAC 804 
Asp Glu Glu Asp Asn Asp Asp Gin Leu Met Val Ser Glu Tyr Val Asn 
195 200 205 

55 GAA ATA TTT TCG TAC TAT TAC GAA TTA GAA ACA CGA ATG TTA CCT GAT 852 
Glu lie Phe Ser Tyr Tyr Tyr Glu Leu Glu Thr Arg Met Leu Pro Asp 
210 215 220 

CCG CAA TAT CTT TTC AAA CAA ACA TTG TTA AAA CCA AGA ATG AGA TCG 900 
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Pro Gin Tyr Leu Phe Lys Gin TJir Leu 
225 230 

ATA TTG GTT GAT TGG CTT GTT GAA ATG 
He Leu Val Asp Trp Leu Val Glu Met 
240 245 

CCT GAA TCA CTT TTT TTG GCA GTC AAT 
Pro Glu Ser Leu Phe Leu Ala Val Asn 
260 

GTT GAA GTG GTT CAA ATA GAT AAA TTA 
Val Glu Val Val Gin He Asp Lys Leu 
275 280 



-46- 

Leu Lys Pro Arg Met Arg Ser 
235 

CAT TTA AAA TTC AAG TTA TTA 948 
His Leu Lys Phe Lys Leu Leu 
250 255 

GTA ATG GAT AGA TTC ATG TCT 996 
Val Met Asp Arg Phe Met Ser 
265 270 

CAA TTA TTG GCT ACA GCA GCT 1044 
Gin Leu Leu Ala Thr Ala Ala 
285 



TTA TTT ACT GCT GCC AAA AAT GAA GAA GTA TTT TCT CCC CTG GTT AAA 1092 
Leu Phe Thr Ala Ala Lys Asn Glu Glu Val Phe Ser Pro Ser Val Lys 
290 295 300 



AAT TAT GCA TAT TTC ACT GAT GGT TCA 
Asn Tyr Ala Tyr Phe Thr Asp Gly Ser 
305 310 

CAA GCA GAA AAA TAC ATG CTT ACC ATT 
Gin Ala Glu Lys Tyr Met Leu Thr He 
320 325 

CCC AAT CCA ATG AAT TTC TTG AGA AGA 
Pro Asn Pro Met Asn Phe Leu Arg Arg 
340 

GAT GTC CAA TCA AGA ACG CTA GGA AAA 
Asp Val Gin Ser Arg Thr Leu Gly Lys 
355 360 



TAT ACT CCA GAA GAA GTG GTA 1140 
Tyr Thr Pro Glu Glu Val Val 
315 

CTT AAC TTT GAT TTG AAT TAC 1188 
Leu Asn Phe Asp Leu Asn Tyr 
330 335 

ATT TCT AAA GCT GAT GAT TAT 1236 
He Ser Lys Ala Asp Asp Tyr 
345 350 

TAT CTT TTG GAA ATC ACT ATA 1284 
Tyr Leu Leu Glu He Thr He 
365 



GTT GAT TAC AAA TTT ATT GGT ATG AGA CCA TCT TTA TGT TGT GCC CTG 1332 
Val Asp Tyr Lys Phe He Gly Met Arg Pro Ser Leu Cys Cys Ala Ser 
370 375 380 

GCC ATG TAT TTA GCA AGA CTA ATA TTG GGC AAA TTG CCA GTT TGG AAT 1380 
Ala Met Tyr Leu Ala Arg Leu He Leu Gly Lys Leu Pro Val Trp Asn 
385 390 395 



GGG AAT TTG ATT CAT TAT AGT GGA GGT TAT AGA ATC AGT GAT ATG AGA 1428 
Gly Asn Leu He His Tyr Ser Gly Gly Tyr Arg He Ser Asp Met Arg 
400 405 410 415 



GAA TGT ATC GAA TTA ATG TTT CAA TAT CTT ATT GCT CCT ATA GAA CAT 1476 
Glu Cys He Glu Leu Met Phe Gin Tyr Leu He Ala Pro He Glu His 
420 425 430 



GAT GAA TTT TTC AAA AAA TAT GCC ATG AGA AAA TTT ATG AGA GCA AGT 1524 
Asp Glu Phe Phe Lys Lys Tyr Ala Met Arg Lys Phe Met Arg Ala Ser 
435 440 445 



ACT CTT TGT CGA AAT TGG GCT AAA AAA TTC CAA GCA TCA GGA AGA GAT 1572 
Thr Leu Cys Arg Asn Trp Ala Lys Lys Phe Gin Ala Ser Gly Arg Asp 
450 455 460 
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25 



30 



40 



- 47 • 

TTG TTT GAT GAA CGA TTA TCG ACC CAT AGG CTA ACA TTA GAA GAT GAT 1620 
Leu Phe Asp Glu Arg Leu Ser Thr His Arg Leu Thr Leu Glu Asp Asp 
465 ' 470 475 

GAC GAA GAA GAA GAA ATA GTG GTA GCA GAA GCA GAA GAG TAAAGTTTTG 1669 
Asp Glu Glu Glu Glu He Val Val Ala Glu Ala Glu Glu 
480 485 490 

AGGACTATTG GATCTAGGTT CTTATCTTTA CAATGCATAA ATGAGGAAAT GAAAGAAGAT 1729 
GAACATGAGT TATGTGCATT ACC 1752 



15 (2) INFORMATION FOR SEQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1070 base pairs 

(B) TYPE: nucleic acid 
20 ' (C) STRANDEDNESS : both 

(D) TOPOLOGY: both 



(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 30.. 1058 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



ATCAAATCCA TCAGAGAACC ACATCAATC ATG TCT ACT GCA GCA GTT GCA ACG' 53 

Met Ser Thr Ala Ala Val Ala Thr 
35 15 



AAA CCA TCT GTC ACT TCA AAA CCA GCA ACT AAA CAA GTT CTG AAT TAC 101 
Lys Pro Ser Val Thr Ser Lys Pro Ala Thr Lys Gin Val Leu Asn Tyr 
10 15 20 

ACC AAA GAA AAA AAA GTA GGG GAA GGT ACA TAT GCT GTT GTG TAC TTG 149 
Thr Lys Glu Lys Lys Val. Gly Glu Gly Thr Tyr Ala Val Val Tyr Leu 
25 30 35 40 

45 GGT AAA CAA ATC TCC ACC AAA CGT CAA ATT GCC ATC AAA GAA ATC AAA 197 
Gly Lys Gin lie Ser Thr Lys Arg Gin lie Ala He Lys Glu He Lys 
45 50 55 

ACA GGA TTA TTC AAA GAT GGG TTG GAT ATG TCA GCA TTG AGA GAA GTG 245 
50 Thr Gly Leu Phe Lys Asp Gly Leu Asp Met Ser Ala Leu Arg Glu Val 
60 65 70 

AAA TAT TTG CAA GAA TTG AAA CAT CCC AAT GTT ATT GAA CTA GTA GAT 293 
Lys Tyr Leu Gin Glu Leu Lys His Pro Asn Val He Glu Leu Val Asp 
55 75 80 85 

GTA TTT TCA GCA ACA AAT AAT TTA AAT TTG GTA TTA GAA TTT CTA CCT 341 
Val Phe Ser Ala Thr Asn Asn Leu Asn Leu Val Leu Glu Phe Leu Pro 
90 95 100 
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TGC GAT TTG GAA GTG TTG ATC AAA GAT AAA TCG ATT GTT TTC AAA TCA 389 
Cys Asp L*eu Glu Val Leu lie Lys Asp Lys Ser lie Val Phe Lys Ser 
105 110 115 120 

GCA GAT ATC AAA TCA TGG CTT TTA ATG ACA TTA CGT GGG ATA CAT CAT 437 
Ala Asp lie Lys Ser Trp Leu Leu Met Thr Leu Arg Gly lie His His 
125 130 135 

TGT CAT CGG AAT TTT ATT TTA CAT CGT GAT TTG AAA CCA AAT AAT TTA 485 
Cys His Arg Asn Phe lie Leu His Arg Asp Leu Lys Pro Asn Asn Leu 
140 145 150 

TTA TTG GCA CCG GAT GGA CAA TTG AAA ATA GCG GAT TTT GGT CTT GCA 533 
Leu Leu Ala Pro Asp Gly Gin Leu Lys lie Ala Asp Phe Gly Leu Ala 
155 160 165 

CGA GCT TTG GTA AAT CCT AAT GAA GAT TTA TCA TCT AAT GTT GTC ACT 581 
Arg Ala Leu Val Asn Pro Asn Glu Asp Leu Ser Ser Asn Val Val Thr 
170 175 180 

AGA TGG TAT AGA GCC CCT GAA TTA TTA TTT GGT GCT CGA CAT TAC ACT 629 
Arg Trp Tyr Arg Ala Pro Glu Leu Leu Phe Gly Ala Arg His Tyr Thr 
185 190 195 200 

GGA GCA GTT GAT ATC TGG TCA ATA GGT ATA ATA TTT GCT GAA TTA ATG 677 
Gly Ala Val Asp lie Trp Ser lie Gly lie He Phe Ala Glu Leu Met 
205 210 215 

CTT CGA ATA CCT TAT TTG CCA GGT AAA GAT GAC GTT GAT CAA TTA GAT 725 
Leu Arg He Pro Tyr Leu Pro Gly Lys Asp Asp Val Asp Gin Leu Asp 
220 225 230 

GTT ACA TTT AGA GCT TAT GGG ACA CCA ACA GAG CAA ATA TGG CCA AAT 773 
Val Thr Phe Arg Ala Tyr Gly Thr Pro Thr Glu Gin He Trp Pro Asn 
235 240 245 

GTT TCC AGT TTG CCA ATG TAT AAT GCA CTT CAT GTG TAT CCA CCT CCT 821 
Val Ser Ser Leu Pro Met Tyr Asn Ala Leu His Val Tyr Pro Pro Pro 
250 255 260 

TCA AGA CAA GAA TTA CGT AAT AGA TTT. AGT GCT GCT ACG GAA AAA GCC 869 
Ser Arg Gin Glu Leu Arg Asn Arg Phe Ser Ala Ala Thr Glu Lys Ala 
265 270 275 280 

CTT GAT TTG TTG ATA TCG ATG ACC CAA TTG GAT CCA AGT AGA AGA TGT 917 
Leu Asp Leu Leu He Ser Met Thr Gin Leu Asp Pro Ser Arg Arg Cys 
285 290 295 

GAT TCT ACA CTA GCA TTA TTA CAC GAT TAT TTT ACT GAA TCG CCT CGT 965 
Asp Ser Thr Leu Ala Leu Leu His Asp Tyr Phe Thr Glu Ser Pro Arg 
300 305 310 

CCT ACT GAC CCG AAA AAG TTG CCT AAA AAG TCT TCT CCA GAA AAG AGA 1013 
Pro Thr Asp Pro Lys Lys Leu Pro Lys Lys Ser Ser Pro Glu Lys Arg 
315 320 325 

GAA AAT GAA GAT GAA CAG AAT AAT GGC TCT AAA AGA AGG CAT GTT 1058 
Glu Asn Glu Asp Glu Gin Asn Asn Gly Ser Lys Arg Arg His Val 
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330 335 340 

TAGGTTTCTA TA 1070 

5 

(2) INFORMATION FOR SEQ ID NO: 6: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 477 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 



15 



20 



55 



(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NT^E/KEY: CDS 

(B) LOCATION: 1..477 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



TGT TCA GCT ATT GAT ACG AAA ACT TCA GTC TCA GCG ATG GAG CAC AAG 48 
25 Cys Ser Ala lie Asp Thr Lys Ser Ser Val Ser Ala Met Glu His Lys 
15 10 15 

ATT GCT ATA AAG AAA GTA ACA AAG ATT TTC AAC AAA GAC ATC CTT CTA 96 
lie Ala lie Lys Lys Val Thr Lys lie Phe Asn Lys Asp lie Leu Leu 
30 20 25 30 

ATC AGG GCA ATA CGA GAG CTT AAG TTC ATG ATG TTT TTC AGA GGC CAC 144 
lie Arg Ala lie Arg Glu Leu Lys Phe Met Met Phe Phe Arg Gly His 
35 40 45 

35 

AAG AAT ATT GCA ACT TTG CTT GAC TTA GAT GTT GTA TAT GTG AAG CCT 192 
Lys Asn lie Ala Thr Leu Leu Asp Ijeu Asp Val Val Tyr Val Lys Pro 
50 55 60 , 

40 TAT GAA GGC TTG TAT TGT TTT CAA GAG CTA GCC GAT TTA GAT TTA GCT 240 
Tyr Glu Gly Leu Tyr Cys Phe Gin Glu Leu Ala Asp Leu Asp Leu Ala 
65 70 75 . . 80 

CGT GTT TTG TAC TCA AAC GTC CAA TTT TCA GAA TTT CAC ATT CAA AGC 288 
45 Arg Val Leu Tyr Ser Asn Val Gin Phe Ser Glu Phe h£s lie Gin Ser 

85 90 95 

TTT ATG TAC CAA ATT CTT TGC GGA CTC AAG TAC ATC CAT TCT GCT GAT 336 
Phe Met Tyr Gin lie Leu Cys Gly Leu Lys Tyr lie His Ser Ala Asp 
50 100 105 110 

GTA ATA CAT CGG GAC CTA AAG CCA GGA AAC ATA TTG GTC ACC ACT CliA 384 
Val lie His Arg Asp Leu Lys Pro Gly Asn lie Leu Val Thr Thr Gin 
115 120 125 



GGG ACT TTA AAA ATA TGT GAT TTC GGC TTA GCA CGA GGA ATA AAT CCT 432 
Gly Thr Leu Lys lie Cys Asp Phe Gly Leu Ala Arg Gly lie Asn Pro 
130 135 140 
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GTA TAT TTC AGA AAC CGC TCA GCT GTC ATC ACA AAC TAG GTA GCA 4 77 

Val Tyr Phe Arg Asn Arg Ser Ala Val lie Thr Asn Tyr Val Ala 
145 150 155 

5 

(2) IITFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 411 amino acids 
10 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

15 (xi) SEQtJENCE DESCRIPTION: SEQ ID NO: 7: 

Met Thr Glu Val Val Ser Lys Ser Ser His Ser Phe Phe Asn Asn Leu 
15 10 15 

20 His Leu Ala Thr Ser Thr Ala Ser Ser Ser Val Ser Ser Thr Thr Pro 
20 25 30 



25 



Lys He Glu Phe Asn Ser He Ala Glu Asn Asp Asp He Pro Thr Asn 
35 40 45 

Tyr Asp Ser Asp Glu Glu Phe Glu Asp Gly Asp Thr Phe He Gin Ser 
50 55 60 



Thr Leu He His Gin Phe Asn Ala Ser Gin Val Thr Thr Thr Thr He 
30 65 70 75 80 

He He He Pro Met Met Val Thr Thr He He Tyr Leu Gin Lys Leu 
85 90 BS 

35 Asp Gly Ser Thr Pro Cys Thr Lys Pro He Lys Arg Leu His Arg Thr 
100 105 110 



40 



Asn Phe Met Lys He He His Phe Glu He Tyr Asn He Glu Tyr Ser 
115 120 125 

His Leu Glu Ser Asp Leu Leu Pro Arg He Asp Ala His Gin Leu Ala 
130 135 140 



Arg He Leu Arg Gly Asp His Asp Asp Gin Phe Asp Glu Phe He He 
45 145 150 155 160 

He Asp Cys Arg Phe Glu Tyr Glu Phe Asn Gly Gly His He Thr Arg 
165 170 175 

50 Ala He Asn He Ser Thr Gin Glu Ala Leu Gin Glu Lys Leu Phe Gin 
180 185 190 

Tyr Gin Glu Thr Asp Thr Lys Asp Thr Glu Ser Lys Lys Arg Leu He 
195 200 ' 205 

55 

He Phe His Cys Glu Phe Ser Met Phe Arg Gly Pro Met Met Ala Lys 
210 215 220 

His Leu Arg Lys Cys Asp Arg Met Cys Asn Tyr Asp Asn Tyr Pro Leu 
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225 230 235 240 

Leu Thr Tyx Pro Asp lie Ala He Leu Glu Gly Gly Tyr Lys Asn Phe 
^ 245 250 255 

Tyr Glu Asn Tyr Pro Gin Trp Cys Asp Pro Gin Gly Tyr Val Glu Met 
260 265 270 

Lys Asn Leu Arg His Lys Lys Leu Cys Glu Ser Asn Leu Asp Lys Val 
10 275 280 285 

Arg Lys Asp Asn Lys Leu Thr Arg Ala Lys Ser Tyr Gin Phe Gly He 
290 295 300 

15 Gin His Arg Arg Gly Gly Ser Thr Gly Gly Leu Phe Gly Asn Tyr Asn 
305 310 315 320 

Tyr Asn Val Met Asn Ser Ser Asp Gin Gin Phe Trp Ser Ser Ser Thr 
325 330 335 

20 

Ser Asn Thr Ala His His Arg Ser Ser Ser Ser Ser Gly Phe lie Asn 
340 345 . 350 

Asn Met His Ser Gly Ala Ser Ser Tyr His His Arg Ser Gin Ser Phe 
25 355 360 365 

Val Thr He Asn Asn Glu Lys He He Lys Arg Glh Arg Ser Thr Pro 
370 375 380 

30 Lys Val Ser Asn Ser Pro Thr Lys Pro Pro His Gin Leu Tyr Leu Leu 
385 390 395 400 



35 



He Asn Pro Phe Arg Trp Leu He Phe lie Asp 
405 410 



(2) INFORMATION FOR SEQ ID NO: 8; 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



45 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ. ID NO:8: 



Met Thr Lys Pro Arg Phe Leu Thr Arg Tyr Arg Lys Ser Lys Ser Val 
'^5 10 ' 15 

50 

Gly He Ser Asp Met He His Tyr Ser Pro Arg Tyr Ser Asp Asp Ser 

20 ' 25 30 

Tyr Glu Tyr Arg His Val Met Leu Pro Lys Asn Met Leu Lys Ala He 
55 35 40 45 

Pro His Asp Tyr Phe Asn Gin Glu Thr Gly Thr Leu Arg He Leu Thr 
.50 55 60 
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Glu Glu Glu Trp Arg Gly l*eu Gly He Thr Gin Ser Leu Gly Trp Ala 
65 70 75 80 

His Tyr Glu Thr His Ala Pro Glu Pro His He Leu Leu Phe Lys Arg 
85 90 95 

Pro Leu Asn Pro Gly Gin 
100 

(2) INFORMATION FOR SEQ ID NO : 9 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 317 amino acids 
15 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

20 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Met Val Glu Leu Ser Asp Tyr Gin Arg Gin Glu Lys Val Gly Glu Gly 
1 5 10 15 

25 Thr Tyr Gly Val Val Tyr Lys Ala Leu Asp Thr Lys His Asn Asn Arg 

20 25 30 

Val Val Ala Leu Lys Lys He Arg Leu Glu Ser Glu Asp Glu Gly Val 
35 40 45 

30 

Pro Ser Thr Ala He Arg Glu He Ser Leu Leu Lys Glu Met Lys Asp 
50 55 60 

Asp Asn He Val Arg Leu Tyr Asp He He His Ser Asp Ser His Lys 
35 65 70 75 80 

Leu Tyr Leu Val Phe Glu Phe Leu Asp Leu Asp Leu Lys Lys Tyr Met 
85 90 95 

40 Glu Ser He Pro Gin Gly Val Gly Leu Gly Ala Asn Met He Lys Arg 
100 105 110 

Phe Met Asn Gin Leu He Arg Gly He Lys His Cys His Ser His Arg 
115 120 125 

45 

Val Leu His Arg Asp Leu Lys Pro Gin Asn Leu Leu He Asp Lys Glu 
130 135 140 

Gly Asn Leu Lys Leu Ala Asp Phe Gly Leu Ala Arg Ala Phe Gly Val 
50 145 150 155 160 

Pro Leu Arg Ala Tyr Thr His Glu Val Val Thr Leu Trp Tyr Arg Ala 
165 170 175 

55 Pro Glu He Leu Leu Gly Gly Lys Gin Tyr Ser Thr Gly Val Asp Met 
180 185 190 

Trp Ser Val Gly Cys He Phe Ala Glu Met Cys Asn Arg Lys Pro Leu 
195 200 205 
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Phe Pro Gly Asp Ser Glu lie Asp Glu lie Phe Arg lie Phe Arg lie 
210 215 220 

5 Leu Gly Thr Pro Asn Glu Glu lie Trp Pro Asp Val Asn Tyr Leu Pro 
225 230 235 240 

Asp Phe Lys Ser Ser Phe Pro Gin Trp Lys Lys Lys Pro Leu Ser Glu 
245 250 255 

10 

Ala Val Pro Ser Leu Asp Ala Asn Gly lie Asp Leu Leu Asp Gin Met 
260 265 270 

Leu Val Tyr Asp Pro Ser Arg Arg lie Ser Ala Lys Arg Ala Leu lie 
15 275 280 285 

His Pro Tyr Phe Asn Asp Asn Asp Asp Arg Asp His Asn Asn Tyr Asn 
290 295 300 

20 Glu Asp Asn lie Gly lie Asp Lys His Gin Asn Met Gin 
305 310 315 



it) INFORMATION FOR SEQ ID NO: 10: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 92 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

30 - 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

35 Met Pro Gin Val Thr Lys Thr Asn Asn Glu Asn Glu Phe Arg Leu Thr 
1 5 10 15 

Arg Ser Lys Val Gin His Gin Glu Ser lie Ser Thr lie Lys Asn Thr 
20 25 30 

Thr lie Ser Asn Ser Gin His Lys Gin Gin Thr Gin Gin Gin lie Ser 
35 40 45 



40 



Ser Pro Pro Gin Val Ser Val Thr Ser Ser Glu Gly Val Ser His Val 
45 50 55 60 

Asn Thr Arg Gin Tyr Leu Gly Asp Val Ser Asn Gin Tyr lie Thr Asn 
65 70 75 80 

50 Ala Lys Pro Thr Asn Lys Arg Lys Pro Leu Gly Gly Asp Asn Ala Pro 

85 90 95 

Leu Gin Lys Gin Gin His Arg Pro Ser Arg Pro lie Pro lie Ala Ser 
100 105 110 

55 

Asp Asn Asn Asn Asn Gly Ser Thr Ser Ser Ser Ser Asn Ser Ser Asn 
115 120 125 

Asn Asn Asn Asn Asp Ala Asn Arg Leu Ala Ser Leu Ala Val Pro Ser 
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130 135 140 

Arg Leu Pro Gin Lys Arg Gin Ala Thr Glu Ser Ser Thr Asn Leu Val 
145 150 155 160 

Glu Lys Leu Arg Val Pro Gin Pro Glu Val Gly Glu Arg Ser Gin Ser 
165 170 175 

Tyr His Lys Lys Ser Arg Leu lie Asp Tyr Glu Trp Gin Asp Leu Asp 
180 185 190 

Glu Glu Asp Asn Asp Asp Gin Leu Met Val Ser Glu Tyr Val Asn Glu 
195 200 205 

lie Phe Ser Tyr Tyr Tyr Glu Leu Glu Thr Arg Met Leu Pro Asp Pro 

210 215 220 

Gin Tyr Leu Phe Lys Gin Thr Leu Leu Lys Pro Arg Met Arg Ser lie 
225 230 235 240 

Leu Val Asp Trp Leu Val Glu Met His Leu Lys Phe Lys Leu Leu Pro 
245 250 255 

Glu Ser Leu Phe Leu Ala Val Asn Val Met Asp Arg Phe Met Ser Val 
260 265 270 

Glu Val Val Gin lie Asp Lys Leu Gin Leu Leu Ala Thr Ala Ala Leu 
275 280 285 

Phe Thr /Via Ala Lys Asn Glu Glu Val Phe Ser Pro Leu Val Lys Asn 
290 295 300 

Tyr Ala Tyr Phe Thr Asp Gly Ser Tyr Thr Pro Glu Glu Val Val Gin 
305 310 315 320 

Ala Glu Lys Tyr Met Leu Thr lie Leu Asn Phe Asp Leu Asn Tyr Pro 
325 330 335 

Asn Pro Met Asn Phe Leu Arg Arg lie Ser Lys Ala Asp Asp Tyr Asp 
340 345 350 

Val Gin Ser Arg Thr Leu Gly Lys Tyr Leu Leu Glu lie Thr lie Val 
355 360 365 

Asp Tyr Lys Phe lie Gly Met Arg Pro Ser Leu Cys Cys Ala Leu Ala 
370 375 380 

Met Tyr Leu Ala Arg Leu lie Leu Gly Lys Leu Pro Val Trp Asn Gly 
385 390 395 400 

Asn Leu lie His Tyr Ser Gly Gly Tyr Arg He Ser Asp Met Arg Glu 
405 410 415 

Cys He Glu Leu Met Phe Gin Tyr Leu He Ala Pro He Glu His Asp 
420 425 430 

Glu Phe Phe Lys Lys Tyr Ala Met Arg Lys Phe Met Arg Ala Ser Thr 
435 440 445 
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Leu Cys Arg Asn Trp Ala Lys Lys Phe Gin Ala Ser Gly Arg Asp Leu 
450 455 460 

Phe Asp Glu Arg Leu Ser Thr His Arg Leu Thr Leu Glu Asp Asp Asp 
5 465 470 475 480 

Glu Glu Glu Glu lie Val Val Ala Glu Ala Glu Glu 
485 490 



10 



(2) INFORMATION FOR SEQ ID NO: IX: 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 343 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 



20 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOrll: 



Met Ser Thr Ala Ala Val Ala Thr Lys Pro Ser Val Thr Ser Lys Pro 
1 5 . 10 15 

25 

Ala Thr Lys Gin Val Leu Asn Tyr Thr Lys Glu Lys Lys Val Gly Glu 
20 25 30 

Gly Thr Tyr Ala Val Val Tyr Leu Gly Lys Gin lie Ser Thr Lys Arg 
30 35 40 45 

Gin lie Ala lie Lys Glu lie Lys Thr Gly Leu Phe Lys Asp Gly Leu 
50 55 60 

35 Asp Met Ser Ala Leu Arg Glu Val Lys Tyr Leu Gin Glu Leu Lys His 
65 70 75 80 

Pro Asn Val lie Glu Leu Val Asp Val Phe Ser Ala Thr Asn Asn Leu 
85 90 95 

40 

Asn Leu Val Leu Glu Phe Leu Pro Cys Asp Leu Glu Val Leu lie Lys 
100 105 110 

Asp Lys Ser lie Val Phe Lys Ser Ala Asp lie Lys Ser Trp Leu Leu 
45 115 120 125 

Met Thr Leu Arg Gly lie His His Cys His Arg Asn Phe lie Leu His 
130 135 140 

50 Arg Asp Leu Lys Pro Asn Asn Leu Leu Leu Ala Pro Asp Gly Gin Leu 
145 150 155 160 

Lys lie Ala Asp Phe Gly Leu Ala Arg Ala Leu Val Asn Pro Asn Glu 

165 ' 170 175 

55 

Asp Leu Ser Ser Asn Val Val Thr Arg Trp Tyr Arg Ala Pro Glu Leu 
180 185 190 - 

Leu Phe Gly Ala Arg His Tyr Thr Gly Ala Val Asp lie Trp Ser lie 
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195 200 205 

Gly He He Phe Ala Glu Leu Met Leu Arg He Pro Tyr Leu Pro Gly 
210 215 220 

5 

Lys Asp Asp Val Asp Gin Leu Asp Val Thr Phe Arg Ala Tyr Gly Thr 
225 230 235 240 

Pro Thr Glu Gin He Trp Pro Asn Val Ser Ser Leu Pro Met Tyr Asn 
10 245 250 255 

Ala Leu His Val Tyr Pro Pro Pro Ser Arg Gin Glu Leu Arg Asn Arg 
260 265 270 

15 Phe Ser Ala Ala Thr Glu Lys Ala Leu Asp Leu Leu He Ser Met Thr 
275 280 285 

Gin Leu Asp Pro Ser Arg Arg Cys Asp Ser Thr Leu Ala Leu Leu His 
290 295 300 

20 

Asp Tyr Phe Thr Glu Ser Pro Arg Pro Thr Asp Pro Lys Lys Leu Pro 
3C5 310 315 320 

Lyc Lys Ser Ser Pro Glu Lys Arg Glu Asn Glu Asp Glu Gin Asn Asn 
25 325 330 335 

Gly Ser Lys Arg Arg His Val 
340 

30 (2) INFORMATION FOR SEQ ID NO : 12 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 amino acids 

(B) TYPE: amino acid 
35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 



40 



Cys Ser Ala He Asp Thr Lys Ser Ser Val Ser Ala Met Glu His Lys 
*1 5 10 • 15 



He Ala He Lys Lys Val Thr Lys He Phe Asn Lys Asp He Leu Leu 
45 20 25 30 

He Arg Ala He Arg Glu Leu Lys Phe Met Mec Phe Phe Arg Gly His 
35 40 45 

50 Lys Asn He Ala Thr Leu Leu Asp Leu Asp Val Val Tyr Val Lys Pro 
50 55 60 

Tyr Glu Gly Leu Tyr Cys Phe Gin Glu Leu Ala Asp Leu Asp Leu Ala 
65 70 75 80 



55 



Arg Val Leu Tyr Ser Asn Val Gin Phe Ser Glu Phe His He Gin Ser 
85 90 95 
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Phe Met Tyr Gin lie Leu 
100 

Val He His Arg Asp Leu 
5 115 

Gly Thr Leu Lys lie Cys 
130 

10 Val Tyr Phe Arg Asn Arg 
145 150 
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Cys Gly Leu Lys Tyr He 
105 

Lys Pro Gly Asn He Leu 
120 

Asp Phe Gly Leu Ala Arg 
135 140 

Ser Ala Val He Thr Asn 
155 



PCT/US96/08807 

His Ser Ala Asp 
110 

Val Thr Thr Gin 
125 

Gly He Asn Pro O 
Tyr Val Ala 
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Claims 

1 . A substantially pure preparation of a Candida TYPl polypeptide. 

5 2. The TYPl polypeptide of claim 1, wherein the TYPl polypeptide comprises an ammo 
acid sequence at least 75 percent homologous to an amino acid sequence represented in 
SEQIDNo.2. 

3. The TYPl polypeptide of claim L which polypeptide functions in one of either role of 
1 0 an agonist or an antagonist of cell cycle regulation of a Candida cell. 

4. The TYPl polypeptide of claim 1, which polypeptide has an intrinsic phosphatase 
activity. 

15 5. The TYPl polypeptide of claim 4, wherein the phosphatase activity of the TYPl 
polypeptide activates a Candida cyclin dependent kinase. 

6. An inununogen comprising the polypeptide of claim 1, in an immunogenic preparation, 
said immunogen being capable of eliciting an immune response specific for the 

20 Candida TYPl polypeptide. 

7. An antibody preparation specifically reactive with the polypeptide of claim 1 . 

8. A recombinantly produced Candida TYPl polypeptide. 



25 



30 



9. The TYPl polypeptide of claim 8, having an amino acid sequence at least 75 percent 
homologous to an amino acid sequence designated by SEQ ID No, 2. 

10. The TYPl polypeptide of claim 8, which polypeptide is a protein phosphatase. 

11. The TYPl polypeptide of claim 8, which polypeptide is a fusion protein further 
comprising a second polypeptide portion having an amino acid sequence from a protein 
unrelated to the Candida TYPl polypeptide. 

12. The TYPl polypeptide of claim 8, which polypeptide hydrolyzes para-nitrophenyl 
phosphate. 

13. The TYPl polypeptide of claim 8, which polypeptide binds to a Candida cyclin- 
dependent kinase. 
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14. A substantially pure nucleic acid comprising a nucleotide sequence which encodes a 
TYPl polypeptide at least 75% homologous to an aniino acid sequence represented in 
SEQ ID No. 2. 

15. The nucleic acid of claim 14, wherein the encoded TYPl polypeptide functions in one 
5 of either role of an agonist or an antagonist of cell cycle regulation of a Candida cell. 

16. The nucleic acid of claim 14, wherein the encoded TYPl polypeptide has an intrinsic 
phosphatase activity. 

17. The nucleic acid of claim 16, wherein the phosphatase activity of the TYPl polypeptide 
activates a Candida cyclin dependent kinase. 

10 18. The nucleic acid of claim 16, wherein the phosphatase activity of the TYPl polypeptide 

hydrolyzes para-nitrophanylphosphate. 
19. The nucleic acid of claim 14, which nucleic acid further comprises a transcriptional 

regulatory sequence operably linked to said nucleotide sequence so as to render said 

nucleotide sequence suitable for use as an expression vector. r 
15 20. An expression vector, capable of replicating in at least one of a prokaryotic cell and 

eukaryotic cell, comprising the nucleic acid of claim 14. 

21. A host cell iransfecied with the expression vector of claim 20. 

20 22. A method of producing a recombinant Candida TYPl protein comprising culturing the 
cell of claim 21 in a cell culture medium to express said TYPl protein and isolating said 
TYPl protein from said cell culture. 

23. A probe/primer for identifying nucleic acid encoding a regulatory protein of a Candida 
25 cell, which probe/primer comprises a nucleotide sequence which hybridizes under 

stringent conditions to at least 10 consecutive nucleotides of sense or antisense 
sequence of a nucleic acid selected from a group consisting of SEQ ID No. 1, SEQ ID 
No. 2, SEQ ID No. 3, SEQ ID No. 4, SEQ ID No. 5 and SEQ ID No. 6. 

30 24. The probe/primer of claim 23, further comprising a label group anached thereto and 
able to be detected. 

25. A diagnostic test kit for identifying nucleic acid of a Candida organism, comprising the 
probe/primer of claim 23, for measuring a level of a nucleic acid encoding the 

35 regulatory' protein in a biological sample. 

26. A method of identifying a compound which is an inhibitor of TYPl phosphatase, 
comprising the steps of: 

a) generating a combination including: 
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1 ) a test agent to be assessed; 

2) a cell free preparation of a TYPl phosphatase from Candida, and 

3) a substrate of the TYPl phosphatase, other than an active cyclin 
dependent kinase (CDK); 

5 b) maintaining the combination imder conditions appropriate for the TYPl 

phosphatase to convert the substrate to product; and 
c) measuring the conversion of the substrate to product, 
wherein a statistically significant decrease in the conversion of substrate to product in 
the combination, relative to a control comprising TYPl phosphatase and the substrate 
10 and lacking the test agent, indicates that the test compound is an inhibitor of the TYPl 

phosphatase. 

27. The method of claim 26, wherein the TYPl phosphatase is a component of a fusion 
protein. 

15 

28. The method of claim 27, wherein the fusion protein is a glutathione-S-transferase/TYPl 
phosphatase fusion protein. 

29. The method of claim 26, wherein the conversion of substrate to product provides a 
20 colorimetric indicator of phosphatase activity. 

30. The method of claim 29, wherein the substrate is a synthetic substrate of TYPl 
phosphatase comprising a colorimetric label which is detectable when the substrate is 
converted to product. 



25 



30 



31, The method of claim 30, wherein the substrate is selected firom the group consisting of 
p-nitrophenylphosphate (pNPP), fluorosceindiphosphate (FDP), and 3-0- 
methylfluoroscein phosphate (3-MFP). 

32. The method of claim 26, wherein the TYPl phosphatase comprises a polypeptide 
having an amino acid sequence represented in SEQ ID No. 2. 



An assay for screening test agents for an inhibitor of an interaction of a cyclin 
dependent kinase (CDK) with a TYPl polypeptide, comprising: 
35 i) generating a combination including: 

a) a test agent to be assessed; 

b) a cell free preparation of a TYPl poK'peptide from Candida: and 

c) a cyclin dependent kinase; 
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ii) detecting the formation of a complex including said CDK and said TYPl 
polypeptide, 

wherein a statistically significant decrease in the formation of said complex in the 
presence of said test agent, relative to the formation of a CDK/TYPl complex in the 
absence of said test agent, is indicative of said test agent being an inhibitor of the 
interaction between said CDK and said TYPl polypeptide. 

34. A method for screening test agents for an inhibitor of an interaction of a cyclin 
dependent kinase (CDK) with a TYPl polypeptide, comprising: 

i) providing an interaction trap system including 

a) a first fusion protein comprising a cyclin-dependent kinase (CDK) 
portion, and 

b) a second fusion protein comprising a Candida TYPl protein 
portion, 

c) maintaining the interaction trap system under conditions wherein 
said interaction trap system is sensitive to interactions between the 
CDK portion of said first fusion protein and said TYPl protein 
portion of said second polypeptide; 

ii) contacting said interaction trap assay with a test agent; 

iii) measuring the interactions between said fusion proteins in the presence of 
said candidate agent; and 

iv) comparing the interactions of said fusion proteins in the presence of said 
candidate agent to interactions of said fusion proteins in the absence of the 
candidate agent, 

wherein a statistically significant decrease in the level of interaction of the fusion 
proteins in the presence of said candidate agent is indicative of the test agent being an 
inhibitor of interactions between CDK and the TYPl protein. 

35. An assay for identifying an inhibitor of a pathogen TYPl phosphatase, comprising 

i. providing a cell expressing a recombinant TYPl phosphatase firom Candida^ 
said cell also having an impaired checkpoint which causes premature entry 
of the cell into mitosis resulting in cell death, the premature entry into 
mitosis being mediated at least in part by the TYPl phosphatase; 

ii. contacting the cell with a candidate agent; 

iii. measuring a level of proliferation of the cell in the presence of the candidate 
agent; and 

iv. comparing the level of proliferation of the cell in the presence of the 
candidate agent to a level of proliferation of the cell in the absence of the 
candidate agenL 
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wherein a statistically significant increase in the level of proliferation in the presence of 
the candidate agent is indicative of inhibition of the TYPl phosphatase by the candidate 
agent. 

5 36. The assay of claim 35; wherein the cell-cycle checkpoint impairment comprises a 
reduction of inhibitory phosphorylation of a cyclin-dependent kinase (CDK). 

37. The assay of claim 36, wherein the cell-cycle checkpoint impairment comprises an 
impaired weel protein kinase activity, an impaired mikl protein kinase activity, or an 

10 over-expression of a niml gene product. 

38. A Schizosaccharomyces cell comprising 

i). an expressible recombinant gene encoding an exogenous TYPl phosphatase 
from Candida', and 

15 ii). a conditionally impairable weel protein kinase which can cause cell death by 

facilitating premature entry of the cell into mitosis under conditions wherein 
the weel protein kinase is impaired, the premature entry into mitosis being 
mediated at least in part by the exogenous TYPl phosphatase and a reduced 
level of inhibitory phosphorylation of a cdc2 protein kinase by the impaired 

20 weel protein kinase. 

39. The Schizosaccharomyces cell of claim 38, wherein the weel protein kinase is 
temperature sensitive and is impaired at a temperature above a permissive temperature. 
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Figure lA 



Lineweaver-Burke Analysis of Candida Cdc25 with p-NPP 
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